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ABSTRACT 

A Density-Quantile  Function  Approach  to  Cliooslnj*  Order  Statlutice 
for  the  Estimation  of  Location  and  Scale  Parameters. 

(August  1979) 

Randall  tester  Eubank*  B.S.,  New  Mexico  State  University;  M.S.* 

New  Mexico  State  University;  M.Stat*  Texas  AAM  University; 

Chairman  of  Advisory  Coamlttee:  Professor  Emanuel  Parsen 

Parzen  (1979)  has  shown  that  the  estimation  of  location  and  scale 
parameters  by  linear  systematic  statistics  may  be  formulated  as  a 
problem  in  regression  analysis  of  a smoothed  sample  quantile  process. 
In  this  dissertation,  a general  approach  to  optimal  spacings  selection 
Is  presented  that  utilizes  design  techniques  for  continuous  parameter 
time  series  regression  developed  by  Sacks  and  Ylvisaker  (1966,  1968). 
This  methodology  is  applied  to  several  common  distributions.  The 
problems  of  optimal  order  statistic  selection  for  estimation  In 
censored  samples,  for  quantile  estimation  and  for  the  suamarlsatlon 
of  large  data  seta  are  also  considered.* 


ACKN0ULEDCQ4ENTS 

I wish  to  express  my  sincere  appreciation  to  Dr.  Emanuel  Parsen 
for  his  interest  and  cooperation  throughout  this  undertaking  and 
for  Che  invaluable  experience  of  our  association.  Without  his  Insight 
and  guidance,  this  dissertation  would  not  be  possible. 

My  greatest  personal  indebtedness  is  to  ay  wife,  Ellas.  She 
has  provided  constant  support  throughout  my  graduate  education.  Her 
assistance  In  the  recopylng  and  proofreading  of  this  manuscript  was 

N. 

invaluable. 

1 am  grateful  to  Dr.  W.  B.  Smith,  Dr.  L.  J.  Ringer,  Dr.  T.  E. 
Vehrly,  Dr.  0.  J.  Uartflel,  and  Dr.  Dan  Colunga  for  their  willingness 
to  serve  on  oy  advisory  connlttee.  A special  word  of  thanks  goes  to 
Dr.  V.  B.  Smith  for  his  temporary  service  as  my  advisor.  It  has 
been  both  a pleasure  and  an  honor  to  be  associated  with  the  faculty 
and  staff  at  the  Institute  of  Statistics.  The  interest  and  care  they 
show  in  their  students  helps  remove  many  of  the  obstacles  and  com- 
plexities associated  with  graduate  work  thereby  smking  It  a truly 
enjoyable  experience. 

Finally  I would  like  to  express  my  gratitude  to  Ha.  Linda  Bishop 
for  the  typing  of  the  manuscript,  to  Mr.  D.  L.  Hawklnt  for  his 
programsdng  aselstanee,  and  to  Mr.  Louis  Caston  for  hla  aid  with 
the  figures. 


vl 


TABLE  OF  CONTENTS 


TABLE  OF  COKTENTS  (CONTINUED) 


SECTION 


PAGE 


SECTION 


FACE 


1 INTRODUCTION 

1.1  Preliainarles 

1.2  Review  of  the  Literature 


1.2.9  Lognotwl  Disttlbutlon  . 

3.2.10  CoBparUon  of  Solutions. 

3.3  Data  Sumarles  for  Large  Saaplas 


1.2.1  Overview 

1.2.2  Noraal  Distribution  . . • 

1.2.3  Exponential  Distribution. 

1.2.4  Pareto  Distribution  . • • 

1.2.5  Cauchy  Distribution  . . • 

1.2.6  Logistic  Distribution  . . 

1.2.7  Wcibull  Distribution.  . . 

1.2.8  Extreme  Value  Distribution 

1.2.9  Gamma  Distribution.  « • • 


7 

8 
9 
9 

10 

10 

11 

11 

12 


1.3  Objectives  

SPACINCS  FOR  UNCENSORED  SAMPLES 


2.1  Preliminaries 

2.2  Definitions  and  Notation  

2.3  Regression  Design  for  a Brownian  Bridge 

Process 

2.4  Location  and  Scale  Parameter  Estimation  as  a 

Continuous  Parameter  Time  Series  Regression 
Problem * *j*  * 

2.5  .Selection  of  Optimal  Spacings  as  a Regression 
Design  Problem  for  the  Quantile  Process.  . . • 

2.6  Comparison  with  Other  Approaches  


APPLICATIONS 


3.1  Preliminaries 

3.2  Spacings  for  Some  Common  Distributions  . » . • • 


3.2.1  Normal  Distribution.  . . . 

3.2.2  Exponential  Distribution  . 

3.2.3  Pareto  Diatribotion.  • . • 

3.2.4  Cauchy  Dlatrlbutlon.  . . • 

3.2.5  Logistic  Distribution.  , . 

3.2.6  Wcibull  Distribution  . . . 

3.2.7  Extreme  Vnluc  Distribution 

3.2.8  Gamma  Distribution  # • . • 


41 

54 

56 

62 

69 

80 

64 

88 


4 SPACINGS  FOR  CENSORED  SAKFLES  AND  QUANTILE 

ESTIMATION 

4.1  Optlmsl  Spsclngs  for  Censored  Samples.  . . 

4.2  Optimal  Spacings  for  Quantile  Estimation  • 

5 CONCLUSION 

5.1  Sunary 

5.2  Problems  for  Further  Research.  ...... 




109 

109 

112 

115 

115 

116 

118 


vii 


viii 


LIST  OF  TABLES 


LIST  OF  TABLES  (CONITNUED) 


FACE 

TABU 

Normal  Distribution,  o Known;  The  Function  11*  \u) 

44 

13 

Normal  Distribution,  o Known;  Asymptotically 

Optioal  Spacinga  and  Coefficients  for  Seven  or 

Nine  Order  Statistics  

46 

14 

Normal  Distribution,  o Known;  A Comparison  of 

Optimal  snd  Asymptotically  Optimal  Spaclngs  and 

Their  Corresponding  Efficiencies 

47 

IS 

Normal  Distribution,  p Known;  The  Function  H*  ^(u) 

48 

Nonul  Dl.trlbutlon,  , Known;  Asymptotldly  OpciB.1 
s,.clne.,  CocffUlcntn  .nd  Ettlctcncle.  for  Sev«n 
or  Nine  Order  Statiscics 

50 

16 

Normal  Distribution,  p Knotm;  A Comparison  of 

17 

Optimal  and  Asymptotically  Optimal  Spaclngs  and 

Their  Corresponding  Efficiencies  

51 

16 

Normal  Distribution,  Both  p snd  0 Unknown;  The 
Function  H*-*(u) 

S2 

19 

Normal  Distribution,  Both  p and  0 Unknown;  Asymp- 

totlcally  Optioal  Spaclngs,  Coefficients  and 
Efficiencies  for  Seven  or  Nine  Order  Statistics  . . 

55 

20 

Exponential  Distribution,  p Known;  A Comparison  of 

Optimal  and  Asymptotically  Optimal  Spaclngs  and  Their 

21 

Corresponding  Efficiencies  . e ' e • . . 

58 

Exponential  Distribution,  p Known;  Asymptotically 

22 

Optimal  Spaclngs,  Coefficients  and  Correction  Factors 

for  Seven  or  Nine  Order  Statistics  

59 

Pareto  Distribution,  v ■ .5,  p Known;  A Comparison 
of  Optimal  snd  Asymptotically  Optimal  Spaclngs 
and  Their  Corresponding  Efficiencies  

63 

23 

Psreto  Distribution,  v " 2,  p Known;  A Comparison 
of  Optimal  and  Asvmpcotically  Optimal  Spaclngs  and 

64 

24 

PAGE 


Pareto  Distribution,  v • 3,  ii  Known;  A CoMparlaon 
of  OptiMl  and  Asymptotically  Optlaal  Spaclngs  and 
Their  Corresponding  Efficiencies  

I 

Pareto  Distribution,  v ■ .5,  u Known;  Aayinpcotically 


Optimal  Spaclngs,  Coefficients  and  Correction  Factors 
for  Seven  or  Nine  Order  Statistics 66 

Pareto  Distribution,  v • 2 , p Known;  Asymptotically 
Optimal  Spacinga,  Coefficients  and  Correction  Factors 
for  Seven  or  Nine  Order  Statistics 67 

Fef*to  Distribution,  v ■ 3,  p known;  Asymptotically 
Optimal  Spaclngs,  Cocfficlcntu  and  Correction  Factors 
for  Seven  or  Nine  Order  Statistics 68 

Cauchy  Distribution,  o Known;  Tl\e  Function  H*(u)  • . 70 


Cauchy  Distribution,  o Known;  Asymptotically  Optimal 
Spaclngs,  Coefficients,  and  Efficiencies  for  Seven 


or  Nine  Order  Statistics  72 

Logistic  Distribution,  p Known;  The  Function  H*(u)  . 75 

Logistic  Distribution,  p Known;  A Comparison  of  Optlaal 
and  Asymptotically  Optimal  Spaclngs  and  Their  Corres- 
ponding Efficiencies  77 

Logistic  Distribution,  p Known;  Asytsptotically  Optimal 
Spaclngs,  Coefficients  and  Efficiencies  for  Seven  or 
Nine  Order  Statistics . , 76 

Logistic  Distribution,  Both  p and  o Unknown;  The 
Function  H*(u)  79 


Logistic  Distribution,  Both  p and  o Unknown;  A 
Comparison  of  AayRq>totically  Optimal  and  Suboptimal 
Spaclngs  and  Their  Corresponding  Efficiencies  . . . , 62 

Logistic  Distribution,  p and  o Unknown;  Asymptotically 
Optimal  Spacinga  and  Coeffieienta  for  Seven  or  Nine 
Order  Statlttiea  63 


I 


LIST  or  TABLES  (CONTINUED) 


TABLE  PACE 


2S  Vcibull  Distribution,  y " V Known;  Asynp** 
totlcally  (^elMl  Spaclngs,  Coeffleiontj, 

Correction  Factors,  and  Efficiencies  for  Seven 
or  Nine  Order  Statistics 8S 


26  Welbull  Distribution,  y • 2,  m Known;  Asymp- 
totically Optimal  Spaclngs,  Coefficients, 

Correction  Factors,  and  Efficiencies  for  Seven 

or  Nine  Order  Statistics 

27  Ueibull  Distribution,  y * 4,  v Known;  Asymp- 
totically Optimal  Spaclngs,  Coefficients, 

Correction  Factors,  and  Efficiencies  for  Seven 

or  Nine  Order  Statistics ft? 

28  Extreme  Value  Distribution,  o Known;  A Comparison 
of  Optimal  and  Asymptotically  Optimal  Spaclngs 

and  Their  Corresponding  Efficiencies  • > ftO 


29  Extrcoic  Value  Distribution,  o Known;  Asymp- 

totically Optimal  Spaclngs,  Coefficients,  and 
Correction  Factors  for  Seven  or  Nine  Order 
Statistics 91 

30  Lognormal  Distribution,  m Known;  Asymptotically 
Optimal  Spaclngs,  Coefficients,  Correction  Factors 
and  Efficiencies  for  Seven  or  Nine  Order  Statistics 


31  Order  Statistic  Selection  for  Location  Parameter 

Estivation  by  Seven  Order  Stat'lstlcs 101 

32  Order  Statistic  Selection  for  Scale  Parameter 

Estimation  by  Seven  Order  Statistics  102 

33  Order  Statistic  Selection  for  Location  Parameter 

Estimation  by  Nine  Order  Statistics 104 

34  Order  Statistic  Selection  for  Seale  Parameter 

Estimation  by  Nine  Order  Statistics 105 

35  Coefficients,  Correction  Factors  and  Efflcienciea 
for  the  Summary  Rule  Spaclngs  for  Seven  or  Nine 

Order  Statistics 106 


UST  OF  FIGURES 


FIGURE  FACE 

1 Normal  Dlstrlbuclon,  o Known;  The  Function 

*3 

2 Normal  Distribution,  p Known;  The  Function 

49 

3 Normal  Dlatributlon,  p and  o Unknown;  Tha 

FunctlMi  (u) 53 

4 Exponential  Distribution,  p Known;  The  Function 

H*-^u) 57 

5 Pareto  Distribution,  p Known,  v • .5, 1,2, 3 ; 

The  Function  (u)  61 

6 Cauchy  Dlatributlon,  o Known;  The  Function 

H*-»(u) 71 

7 Legiscie  Distribution,  p Known;  The  Function 

H*-l(u) 76 

8 Logistic  Distribution,  Both  p and  c Unknown; 

The  Function  H***^u)  . . 61 

9 Extreme  Value  Distribution,  o Known;  The  Function 

89 

10  Caama  Distribution,  p Known;  The  Function 

H*-*(u) 93 

11  The  H*  ^ Functions  for  Various  Distributions  in 

the  Case  that  o Is  Known  96 

12  The  H*  ^ Functions  for  Vsrloua  Distributions  in 

the  Case  that  p Is  Known  97 


1.  liiTRODUCTIOM 
1.1  Preliminaries 

A frequently  occurlng  statlecicel  model  Is  the  location  and 
Kale  parameter  model.  In  this  model,  it  is  assumed  that  the 
:umulatlve  distribution  function  (c.d.f.)  of  independent  identically 
llstrlbuted  random  variables*  X,,  X.*  ...»  X , is  of  the  form 

dkere  is  a knovn  distributional  form  and  u and  o are  respectively 
inknown  location  and  scale  patameteTs.  Tbe  maalmum  ll>(ellhood 
?stlmators  of  n and  o are  often  difficult  to  compute.  Thus  prac- 
:ical  considerations  often  dictate  the  usage  of  estimators  that  are 
Inefficient  vhen  compared  to  the  Cramer-Rao  lower  variance  bound  for 
inblased  parameter  estimation. 

A class  of  estimators  of  U and  o that  have  good  properties  are 
;hose  formed  as  linear  functions  of  the  sample  order  statistics* 

(the  random  sample  X^*  X^ 

irranged  in  increasing  order).  These  estimators  have  been  called 

•ystenatlc  statistics  by  Hosteller  (1946).  Systematic  statistics 

k 

#111  be  of  the  form  I b^X.  . for  X,  v,  X, 

1-1  ^ ^"2 

y(  the  n sample  order  statlsclce.  For  this  reason  the  problem  of 


Utetions  follow  the  format  of  the  Journal  of  the  American  Statie~ 
:icsl  'Aseoclation. 


y ... 


* \ * subset 

(nk) 


estimation  by  systematic  stotlstlcs  consists  of  two  parts;  (a)  the 
selection  of  a set  of  k order  statistics  and  (b)  the  determination 
of  the  coefficients  for  the  order  statistics  selected. 

P^^^^ftipn  1.1.1,  The  quantile  function*  Q*  corresponding  to  a 
distribution  function  T is  dafined  to  be 

Q(u)  ■ P"'(u)  - lnf(x:P(x)  >_  u).  (1.1.2) 

The  p*^**  quantile  (0  < p < 1)  of  the  distribution  is  Q(p). 

Definition  1,1,2.  Define  the  sample  quantile  function*  Q*  by 

“ i O . 1-1.  n . (1.1.3) 

The  p^^  sample  quantile  (0  < p < 1)  is  Q(p). 

It  is  often  more  convenient  to  consider  linear  systematic 

statistlca  aa  being  linear  functions  of  sample  quantiles.  By  letting 
^ k . 

» - B In  , It  follow,  chat  I b X,  - I b.Qfu,)  uid  hone,  the 
1-1  ^ '"i'  1-1.  * 

two  formulations  are  equivalent. 

A basic  result  which  leads  to  the  usefulnees  of  systematic 

statistics  is  the  following  theorem  due  to  Hosteller  (1946). 

Theorem  1.1.1.  Let  F be  an  absolutely  continuous  distribution 
with  probsbillty  density  function  (p.d.f.)  denoti*-!  by  f.  Let 
0<Uj<U2<  *..<U|^<lbek  real  numbers  and  Q(U2)*  QCu^)*  *..* 
Q(u^)  the  cocreeponding  (population)  quantiles.  Further  sssume 


that  f la  differentiable  In  the  nelBliborhood  of  and 

that  f(Q(u^))  5 fQ(Uj)  ^ 0 for  1 - 1 k.  Tl.cn  the  joint 

dlttrlbutlon  of  the  k aaxq>lc  quantllea,  Q<Uj),  ^(uj),  ....  QCu^^). 
tenda  to  a k-variata  norwil  dlatrlhutlon  aa  n tenda  to  Infinity 
with 


AE(Q(Uj))  - q(uj) 

u,(l  - u.) 


1 "l 

ACOV(Q(Uj),(l(Uj))  - 


n fq(Uj)fq(Uj) 


■ “l  i “j  • 


(1.1.*) 


(l.l.S) 


where  AK  and  ACOV  denote  aayaptotlc  expectation  and  covariance 
reapeccivcly. 

When  th«  location  and  scala  paranetcr  aodal  (1.1.1)  holds  the 
p.d.f.  and  quantile  function  have  the  foraa 


f<x) 


i f f- 

O o| 

Q<u)  • u 


(1.1.6) 


where  f end  Q are  respectively  the  p.d.f.  and  the  quantile  function 
0 o 

corre.pondln*  to  the  known  c.d.f.  A corollary  to  Theotca  1.1.1 

concerning  thin  aodel  follown  Inaedlately  uaing  (1.1.6). 

Corollary  1.1.1.  In  addition  to  the  hypotheaea  of  Theorem  1.1.1, 
aaaunc  that  f and  q arc  of  the  form  (1.1.6).  Then  the  limiting 


dlatrlhutlon  of  the  k aamplo  guantllea  la  k-varlatc  normal  with 


AE(ii(Uj))  - p + 00,(0^) 


AC0V(q(u,),q(u,)) 


.1  -^(1  - -j) 


(1.1.7) 


< u.  . (1.1.6) 


For  the  purposes  of  location  and  scale  parsMter  estlMtiong 
Corollary  1.1.1  may  be  Interpreted  as  stating  that,  asymptotically, 
the  sample  quantiles,  Q(Uj>,  Qfuj) Q(Uj^),  satisfy  the  con- 

ditions required  lor  application  of  the  Causs-Markov  Theorem.  Ihus 
asymptotically  best  linear  unbiased  estimators  (ABUJE's)  of  u and/or 
0 may  be  obtained  through  generalised  least  squares.  Ogawa  (19S1) 
has  given  general  formulae  for  these  estimators  and  their  asymptotic 
relative  efficiencies  (ARE*s)  when  either  one  or  both  of  the 
parameters  are  unknown. 

“n  - Vl  * ‘ 'o’.'".’  • 


kH  if,q„(u,)  - 

Ki-  t 


(l.l.S) 


1-1 


k.l  tq.(u.)fAt".)  - <ln(“l-l»n®n<“l-l>’'  _ ^ 


K,  - t 

* 1-1  “1  ■ "l-l 


10) 


k.l  lf.q„(u,)  - 7A<Vl>i'®n<"l>^®,<V  - 


*1  * * - - - 

^ l-l  "1  1-1 


(1.1. 11) 


A - K^Kj  - , 


2m  I ""  ■ ' ' • 

1-1  “i  ■ “i-l 

(1.1.13) 


k+1  , 

<1 

In  thin  notation  the  AktUE'n  and  AXE'a  darived  by  Ogawa  say  ba 
written  an  followa: 

1.  Aaeuae  a ia  known.  Than  tha  AhLUE  for  \i  ia 


- 1 *3 

M - E - 0 J 


with  aaynptotic  rolativa  afiiciancy 


2.  A«>\Ae  w is  known.  Then  the  ABLUE  for  o is 


* 1 . *3 

o - If-  T -ur- 


with  asynptotic  relative  efficiency 


grfxfiesi]' 

ll  t(X) 


(*  denoting  approximate  equality  for  large  values  of  o). 
3.  Aasiae  both  y and  o are  unknown.  ABLUE*s  for  y and  o are 


; - j (KjE  - E3T) 


; - j <.\y  - KjZ) 


with  asymptotic  relative  efficiency 


ARE(y,o)  - * 


eriHi'W)']-')- 


E X ^ 
i L lf(X)J 


Examination  of  equations  (1.1.15)  * (1.1.21)  reveals  that  the 
estimators  and  their  asymptotic  relative  efficiencies  are  all 
functions  of  the  epaclncs.  U2»  ...|  u^.  Therefore,  through 
strategic  placement  of  the  spacings  It  Is  possible  to  further  opti- 
mise asymptotic  estimator  efficiency.  A set  of  spacings  resulting 
In  a maximum  value  for  one  of  the  efficiency  expressions  (1.1.16), 
(1.1.18)  or  (1.1.21)  will  be  termed  an  optimal  spacings  set  while 
the  problem  of  finding  such  sets  will  be  termed  the  optimal  snaclnfis 


(1.1.17) 


7 


0 


il.i-W 


(1.2.25) 


(1.2.26) 


and  Boxlaize  one  of  the  expressions  for  asymptotic  estloiotor 
efficiency  given  In  Section  1.1.  The  next  step  Is  to  find  the  optimal 
<;>Aclngft  Chat  correspond  to  these  solutions.  Much  of  the  literature 
on  the  optlowil  spacings  problea  Is  concerned  with  solutions  obtained 
in  thia  Planner. 


For  certain  distribution  typsSg  the  ssynptotlc  rclsclve 
efficiency  expressions  becone  ^ulte  coaplleaced.  Consequently 
numerical  methods  have  frequently  been  employed  to  find  spsclngs 
sets  resulting  In  optimal  or  near  optimal  ef flclenclea.  The  reaults 
obtained  are  usually  expressed  In  the  form  of  tables  of  optimal 
spaclngs  and  Che  corresponding  coefficients  for  the  ABUJE's  for 
various  values  of  k. 

1.2.2  Normal  Distribution 

Ogswa  (1951)  has  considered  the  optibal  spaclngs  problem  for  the 
normal  distribution.  Zn  the  event  that  o is  knovn,  it  was  shown 
that  there  exists  a unique  aet  of  optimal  spaclngs  for  each  value  of 
k.  These  optimal  spaclngs  were  shown  to  be  symmetric,  l.c. 

-f  Numerical  techniques  were  employed  to  find  the 

optimal  spaclngs  when  k 2(1)10. 

For  the  case  of  a known  Iocs  Ion  parameter,  it  was  found  that 
the  function  ARE(o)  had  many  maxima.  Although  the  greatest  maximum 
was  not  found,  of  the  spacing  sets  considered  those  resulting  in  the 
largest  efficiency  values  were  reported  for  k • 1(1)6. 

Simultaneous  estimation  of  both  the  location  and  scale  parameter 
was  considered  only  for  an  eatimatur  based  on  two  order  statistics. 
This  estimator  was  derived  under  Che  assumpclon  of  sytu&ecrlc  spaclngs. 

Elsenberger  and  Posner  (1965)  have  extended  Ogawa's  results. 
Assuming  o or  u to  be  known, optimal  spaclngs  have  been  calculated 
for  the  cases  k 2(2)20.  Suboptlmal  epaelnge  that  minimise  the 
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6un  of  the  estimators' variances,  V(ij)  V(o),  were  also  found  for 
k - 2(2)20. 

1.2.3.  Exponential  Distribution 

Sarhan  and  Greenberg  (1958)  have  considered  the  estltutlon  of 
the  scale  parameter  of  the  exponential  distribution  under  the 
assumption  that  the  location  parameter  was  known.  Optimal  spaciugs 
were  obtained  for  linear  systematic  estimators  based  on  k ■ 1(1)15 
order  statistics. 

1.2.4  Pareto  Distribution 

For  the  Pareto  distribution  the  location  and  scale  parameter 
model  takes'  the  form 

F<x)  - 1 - (I  + . X > u (1.2.27) 

where  v > 0 Is  a known  shape  parameter.  Kulldorf  and  Vannman  (1973) 
have  considered  the  estimation  of  w and  a by  linear  functions  of 
optimally  spaced  order  statistics. 

For  p assumed  known, optimal  spaclngs  for  the  estimation  of  o 
were  obtained  when  v ■ .S(.5)S  and  k ■ 1(1)10.  Wlien  v ■ 1,  the 
optimal  spacings  were  found  to  be  the  points  - , 1 ■ 1,  ...,  k. 

For  both  u and  o unknown,  It  was  shown  that  optimal  spacings 
sets  exist  for  the  simultaneous  estimation  of  u and  o only  If  the 
sample  is  censored  to  the  left.  Thus  a set  of  modified  estimators 
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was  suggested  which  use  the  optimal  spacings  for  the  estimation  of 
0 ( when  p Is  known)  with  k • 1 sample  quantiles. 

1.2.5  Cauchy  Distribution 

Bloch  (1966)  has  considered  the  estimation  of  the  location 
parameter  of  the  Cauchy  distribution  by  a linear  function  of  five 
order  statistics.  Numerical  techniques  were  utilised  to  obtain  a 
set  of  spacings  that  was  essentially  the  optimum.  These  spacings 
corresponded  to  an  asymptotic  relative  efficiency  of  .95161.  A 
linear  systematic  estimator  based  on  thes'e  five  spacings  was  found 
to  be  superior  to  the  optimum  trlimaed  mean  and  the  sample  median. 

1.2.6  logistic  Distribution 

Gupta  and  Cnanadeslkan  (1966)  have  considered  the  optimal 
spacings  problem  for  the  logistic  distribution.  In  the  case  of  a 
known  scale  parameter,  the  optimal  spacings  for  location  parameter 
estimation  were  found  to  be  the  points  - " , 1 - 1,  ...,  k.  An 

explicit  form  for  the  estimator  based  on  these  optimal  spacings 
was  given  as 

^ (iTTt) 

with 

ARE(i)  - ■ (1.2.29) 
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Optimal  spacin^s  for  the  eRtlouitlon  of  o,ln  the  event  that  p Is 
knovn.vcrc  obtained  for  two  or  three  symmetrically  spaced  order 
statistics. 

Hassanein  (1969b)  has  considered  suboptimal  simultaneous 
location  and  scale  parosntcr  estimation  using  spacings  that  minimize 
the  sum  of  the  variances  of  the  cstlnatoro.  Hie  spacings  for  these 
suboptimal  estimators  were  given  for  the  cases  k * 2(1)9. 

1.2.7  Ueibull  Distribution 

The  location  and  scale  parameter  model  for  the  Weibull  distri- 
bution is  of  the  form 

P(x)  . 1 - exp  » > K . <1.2.30) 

where  y is  a known  positive  shape  parameter.  Hassanein  (1971)  has 
obtained  optimal  spacings  for  the  simultaneous  estimation  of  both 
the  location  and  scale  parameters  of  this  distribution.  The  optimal 
spacings  are  given  for  estimators  based  on  two,  four,  or  sis  order 
statistics  for  the  cases  y ■ 3(1)10(5)20., 

1.2.8  Extreme  Value  Distribution 

Hassanein  (1968)  has  studied  the  estimation  of  location  and 
scale  parameters  in  the  extreme  value  distribution.  Optimum 
spacings  for  the  estimation  of  the  location  parameter  when  the  scale 
parameter  is  known  were  given  for  the  cases  k - 1(1)15.  Under  the 


assumption  chat  Che  location  parameter  is  known,  optimum  spacings 
for  scale  parameter  estimation  were  obtained  for  k • 1(1)4.  An 
Iterative  scheme  was  proposed  for  simultaneous  estlmstion  of  p 
and  a by  linear  functions  of  two  order  statistics. 

The  problem  of  selecting  optimal  spacings  for  simultaneous 
location  and  scale  parameter  estimation  has  been  considered  by 
Hassanein  (1969a,  1972).  Spacings  that  minimise  Che  sum  of  the 
variances  of  the  estimators  have  been  obtained  tor  k • 2,  3,  4 
and  optimal  spacings  have  been  obtained  for  k • 1(1)10. 

1.2,9  Gamma  Distribution 

The  location  and  scale  psraaeter  model  for  the  gamma  distri- 
bution results  In  a p.d.f.  of  the  form 


where  p > 0 is  assumed  known.  Sarndal  (1964)  has  obtained  nearly 
optimum  spacings  (In  the  sense  that  they  result  in  near  maximum 
are's)  for  the  estimation  of  the  scale  parameter  of  the  gansaa 
distribution,  when  p is  assumed  known  for  k • 1(1)10  with  p ■ 2(1)5. 
The  techniques  utilised  in  the  calculation  of  these  spacings  are  a 
special  case  of  a general  approach  to  optimal  spacings  selection 
developed  by  Samdal  (1962). 

Hassanein  (1977)  has  obtained  suboptimal  spacings  for  simul- 
taneous location  and  scale  parameter  estimation  that  maximize  tbe 
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8uts  of  the  efficiencies  of  the  estimators.  Spaclngs  sets  were 
given  for  cstlnutors  basoJ  on  k * 2(1)5  order  statistics  with 
p - 3(1)10(5)20. 

1.3  Objectives 

The  purpose  of  this  research  is  threefold:  (a)  to  formulate 
the  problems  of  location  and  scale  parameter  estimation  and  spacing 
selection  in  a unified  framework,  based  on  regression  analysis  of 
the  continucus  paraiueter  sa"‘plc  quantile  function  Q(u),  0 n ^ 1, 
thereby  developing  a general  computationally  simple  solution  to 
the  optimal  spacings  problem,  (b)  to  apply  this  technique  to  several 
common  distributions,  and  (c)  to  develop  guidelines  for  the  selection 
of  order  statistic  subsets  for  the  summarization  of  large  data  seta. 
The  problems  of  optimal  order  statistic  selection  for  estimation 
in  censored  samples  and  quantile  estimation  will  also  be  considered. 


T 
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2.  SPACINCS  FOR  UNCENSORED  SAMPLES 


2.1  Preliminaries 


As  seen  in  Chapter  1,  the  classical  approach  to  optimal  location 
and/or  scale  parameter  estimation  by  linear  functions  of  order 
statistics  has  centered  upon  the  efficiency  of  the  estimators.  The 
asymptotic  relative  efficiencies  of  these  linear  systematic  statistics 
arc  functions  of  the  spacings  of  the  sample  quantiles  included  in 
the  estimators.  Ttius  spacings  that  maximize  expressions  for  asymp- 
totic efficiency  correreaAd  to  a best  set  of  sample  quantiles  to  be 
used  for  estimation  purposes.  Therefore,  classically,  optimal 
estimators  were  obtained  by  first  finding  spacings  that  resulted  in 
maximum  values  for  asymptotic  efficiencies  and  then  using  the 
corresponding  sample  quantiles  to  estimate  location  and  scale  para- 
meters by  generalized  least  squares. 

There  are  several  problems  associated  with  the  classical 
approach.  Two  such  difficulties  arc: 

1.  Finding  spacings  that  result  in  maximum  values  for 

the  asymptotic  relative  efficiencies  is  often  computationally 
quite  'ifflcult. 

2.  There  is  no  unified  franicwork  for  solving  the  problem  of 
optimal  estimation. 

In  this  chapter,  a new  approach  to  optimal  location  and  scale 
parameter  eatlsutlon  la  presented  which  alleviates  many  of  the 


problems  Inherent  In  the  classical  acthod.  It  will  be  seen  In  this 
snd  subBe<iucnC  chapters  that  the  coaputatlonal  aspects  of  this  new 
procedure  are  quite  slaplc.  Also  by  using  this  approach,  the 
problem  of  obtaining  optimal  linear  systematic  estimators  of  the 
location  and/or  scale  paraaetera  may  be  foraulated  In  a unified 
regression  framework. 

Tlie  principal  results  of  this  chapter  are  contained  in  Section 
2.5.  Sections  2.2  through  2.6  provide  the  necessary  background 
for  the  development  of  these  results.  Section  2.2  contains  a few 
preliminary  concepts  and  definitions.  Section  2.3  treats  the  topic 
of  regression  design  for  a Brownian  Bridge  process.  In  Section  2. A, 
it  is  seen  that  the  problem  of  location  and  scale  parameter  estimation 
can  be  forwilatcd  as  one  of  continuous  parameter  time  aeries  regres" 
slon.  The  results  of  Section  2.3  and  2. A are  combined  In  Section  2.5 
to  obtain  estimators  of  u and/or  o based  on  asymptotically  optimal 
spaclngs.  Finally  In  Section  2.6,  the  approach  taken  here  Is 
contrasted  with  those  of  Chernoff  (1971)  and  Sarndal  (1962). 


2.2  Definitions  and  Notation 


properties  for  every  u in  T (where  IC(*.u)  is  the  function  defined 
on  T whose  value  at  s in  T is  K(s,u)): 


K(*.u)  c H 


(g.K(*.u))  - g(u) 


for  every  g in  H. 


A kernel  that  will  be  seen  to  be  of  particular  Interest  is  the 
covariance  kernel  of  a Brownian  Bridge  process. 

Definition  2.2.2.  A Brownian  Bridge  process  {B(u),  u c (0,1])  is 
a aero  mesj)  normal  process  vith  covsrSsace  kernel 

*,(Uj,Uj)  - ■ln,(u^.Uj)  - UjUj  . (2.2.3) 

The  Hilbert  function  space,  generated  by  consists  of 
differentiable  function,  eatl.fylng  f(0)  - f(l)  - 0 for  every  f In 
Hj.  The  Inner  product  of  two  function  f end  g In  H,  U 


This  section  contains  definitions  and  notation  that  will  be 
used  In  subsequent  sections. 


• / f*(u)g'(u)du  . 
0 


Definition  2.2.1.  A reproducing  kernel  Hilbert  space  (RKHS) , with 
reproducing  kernel  K,  is  a Hilbert  space,  H,  of  functions  defined 
on  s set  T.  The  kernel  K Is  a function  on  T » T with  the  following 


By  taking  g(u)  • f(u),  It  is  seen  that  for  any  f In  H 


{f(u))2du  . 

0 
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2.3  Regression  Design  for  a Drownlnn  Bridge  ProccsB 


Let  (B(u),  u c [0»1])  be  the  Brownian  Bridge  process  defined 
in  Section  2.2.  Consider  the  regression  nodcl 


Y(u)  - E 8.f.<u)  * aB(u)  , u c (O.ll  > 
i-1  ^ ^ 

COV(Y(s),  Y(t))  - 


(2.3.1) 


where  f2*  ...»  *te  given  regression  functions*  8^*  ^2*  '*** 
a ere  b 1 unknown  parsaeters  end  is  given  by  (2.2.3).  For 
this  Bodcl  an  infinite  observation  set  is  feasible,  in  which  case 
paraaeter  estiaacion  say  be  accospllshed  using  techniques  developed 
by  Paracn  (1961a, b).  It  is  often  note  convenient  to  take  obser- 
vations at  only  a finite  nuaber,  k,  of  points  on  10,1]  as  then 
paraaeter  estlMtes  say  be  obtained  by  generalised  least  squares. 
Since  the  point  set  to  be  selected  Is  at  the  disposal  of  the 
experi&entcr.  It  should  be  selected  in  such  a manner  that  the 
resultant  cstloators  have  optimal  properties  over  all  such  estimators 
formed  from  the  same  nuaber  of  observations. 

The  problem  of  selecting  a "best**  set  of  points  (for  estimation 
purposes)  at  which  to  sample  from  the  process  {Y(u),  u c [0,1]) 
is  one  of  design  for  continuous  paraaeter  time  series  regression. 

The  point  secs  are  called  designs  and  the  points  themselves  are 
called  design  points.  Since  sampling  over  time  obviates  the 


repetition  of  observations  at  a particular  design  point,  it  is 
necessary  to  define  what  is  meant  by  a k-point  design. 


Definition  2.3.1.  A k-polnt  design  for  a Brownian  Bridge  process 

(and  hence  for  {Y(u),  u e [0,1]))  ia  a k-tuplc,  {u^,  u^ , 

with  0 < u^  < Uj  < ...  < Uj^  < 1.  Denote  by  the  set  of  all  such 
k point  designs. 

For  T c Dj^,  let  denote  the  BLUE  of  8 • (8^,  Bj,  ...,  8^)' 
based  on  observations  taken  according  to  T.  Let  8 denote  the 
estimator  of  8 obtained  using  observations  over  all  of  [0,1]. 

Optimal  designs  are  those  that  minimise  Che  variance  of  the 
estimator  in  the  case  m ■ 1 or  minimise -the  generalised  variance  of 
the  estimators  in  Che  case  m >^2.  As  a result  of  the  order 
restrictions  satisfied  by  design  points,  is  not  a compact  set. 
Consequently,  optimum  designs  frequently  do  not  exist  and  are  usually 
difficult  to  construct.  This  leads  to  the  consideration  of  design 
sequences  that  are  asymptotically  optimal. 


Definition  2.3,2.  For  the  case  b“1  a design  sct^uencc  l*  ^k  ^ ^k* 

la  asymptotically  optimal  for  estimating  8^  £ 8 if 


lim 

k-** 


V(B_  ) - V(B) 

*k 

inf  V(B.)  - V(6) 

■T^n  * 


1 


(2.3.2) 


Definition  2.3.3.  For  the  case  m ^ 2 a design  sequence  . 

c D^  is  asymptotically  optimal  for  estimating  8 If 


i 


Then 


lim  < 

I "‘X’"' 

- |V(S)-'| 

llnf  lv($  )‘^ 
^TcD. 

1-  |V(B)"‘| 

(2.3.3) 


Design  se<iuences  aay  be  constructed  through  the  use  of  density 
functions.  Let  h be  s continuous  non'^negstive  density  function  on 
10, I]  with  tssoclsted  distribution  function 


u 

H(u)  - / h(t)dt  . (2.3.4) 

0 

The  density,  h,  generates  a design  sequence  where 

and  Tj^  - {h'*(  ),  h'*(  ) " 

asynptotlfally  optimal  design  sequence  will  be  seen  to  be  equivalent 
to  finding  ai>  opt Imal  density,  l.e.,  finding  a density  that  generates 
an  asymptotically  optimal  density  sequence. 

The  next  theorem  gives  densities  that  generate  asymptotically 
optimal  design  sequences.  Its  proof  can  be  found  as  a straightforward 
application  of  results  obtained  by  Sacks  and  Ylvisaker  (1966,  1968). 

t 

Tlicorcm  2.3.1,  Let  be  twice  contlnuou--ly  differentiable  on  10,11 
and  have  the  representation 

ft(u)  • - fj’'(  )K,(u,t)dt  , 1-1 • . (2.3.5) 


For  m ■ 1 the  density 

Il,"(u)l^ 

(2.3.6) 

/ If,"(t)J^dt 

0.  * 

generates  asymptotically  optimal  design  sequences  for 

..tlwtlnt  1 5. 

For  • > 2 1.C 

F(i.)  - -(fj"(u),  fj”(u) £,"(.))• 

(2.3.7) 

A - , 1.  3 - 1 ■ . 

(2.3.8) 

The  density 

(♦•(u)A*',(u)l'/3 

h»(u)  (2.3.») 

/ l♦'(t)A■',(t)l'^dt 

0 

generates  asymptotically  optimal  deaigna  for  estimating 

8 - (8j,  Sj BJ'. 

Remark  2.3.1.  Let  T * (u^,  U2 c and  m * 1.  Denote  by 

P^f^  the  projection  of  f^  onto  the  linear  manifold  generated  by  the 
functlone  K^(*,u^),  1 • 1,  ...,  k . It  can  be  shown  that 


v(Bt)  -iivjr  - r lUif -iifj  - 


Finding  a sequence  ssclsfying  (2.3.2)  is  roughly  (flp;irt  from  conver- 
gence rate  considerations)  the  ssse  as  finding  a sequence  • 

such  that 

11.  I Ilf,-?  f !i’-l„£||f  -p  f 1|2)  . 0 . (2-3-11) 

Ir-M  * *1,  * ip-n  A LX 


Reaark  2.3.2.  Let  h be  a density,  and  suppose  design 

sequence  generated  by  h.  When  a ■ 1, Sacks  and  Ylvlsaker  (1968) 
have  shown  that 


11*1  ' ’’i.  hll  ’ ■ 12  ^ 


hCu)  J 


(2.3.12) 


provided  h and  satisfy  any  of  the  following  conditions: 

j -2 

1.  I (h(u))  du  < • and  f."  la  continuous  on 

a * 


2.  —r—  is  continuous  on  [0,1]  . 

h 


3.  There  exist  a constant  C such  that 
b b 

(b  - a)  / h’(u)<lu  < C(/  h(u)dul’  for  >11  I>,b]  cr  (0,1)  . 


The  authors  have  also  shown  that 


llak*  Inf  Ilf,  - [/Nf,'’(u))^3^ul5  . (2.3.13) 

k^  TcD..  ^ ^ ^ 0 ^ 
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Tlius  In  view  of  Reaark  2.3.1  and  equation  (2.3.12)  the  task  of  finding 
an  asyaptotically  optiaal  design  sequence  can  be  acccnpllshed  by 
finding  a density  function  that  attains  the  bound  (2.3.13).  Such 
a density  is  (2.3.6). 

Reaark  2.3.3.  A aultl-paraaeter  version  of  Remark  2.3.2  aay  also 
be  aade.  Sacks  and  Ylvlsaker  (1968)  have  shown  that  for  (a^,  .... 
a^)  a sec  of  positive  numbers 

lla  Inf  k*  I >,  ||f,-P_  t >,(-f  ''(uljM'^dn)’  (2.3.14) 

k-«»  1-1  k 0 1-1 

for  any  sequence  of  designs  • Finding  an  asyaptotically 

optiMl  design  is  equivalent  to  finding  a density  h*  that  attains 
a lower  bound  of  the  fora  (2.3.14).  Such  an  optimal  density  is 
given  by  (2.3.9). 

Reaark  2.3.4.  Ihe  design  sequences  defined  In  Theorea  2.3.1  differ 
froa  chose  suggested  by  direct  application  of  Che  formulae  of 
Sacks  and  Ylvlsaker  (1966,  1968).  These  results  may  be  reconciled 
by  noting  that  no  Inforaatlon  is  obtained  froa  observations  taken 
at  0 or  1 for  regression  functions  In  the  RIQIS  generated  by  K^. 

This  is  because  (as  noted  in  Section  2)  such  functions  are  neces- 
sarily zero  at  these  points.  By  taking  the  (k  f 2)^^  elcncnt  of 
the  sequences  suggested  by  these  authors  and  disregarding  design 
points  at  0 and  1 Theorem  2.3.1  aay  be  obtained. 
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Rmark  2.3.5*  To  obtain  designs  for  an  Interval  Ip.ql  and 
regression  functions  of  the  fom 

q 

»l(“)  • -/  + CjKj(u,p)  + CjIt^Cu.q),  1-1 ■ , 

9 

(2.3.15) 

a Bodlfieatlon  of  (2.3.6)  and  (2.3.9)  la  required.  Although  the 
for«  of  Che  optlsMi  densities  are  not  altered, all  lialts  of 
integration  nust  be  changed  froa  0 and  1 to  p and  q.  The  design 
points  for  the  (k  * 1)*^  eleaent  In  the  sequence  are  then 
).  1 - 0 k . 

2.4  Location  and  Scale  Paraaeter  Estimation  aa 
a Continuous  Paraaeter  Time  Series  Regression  Problea 

Parsen  (1979)  has  phrased  the  problea  of  linear  estlaatlon  of 
location  and  scale  paraaeters  as  a problem  in  regression  analysis 
of  a smoothed  sample  quantile  process  (f^Q^(u)Q(u) , u c [0,1]). 

The  formulation  rests  upon  s theorca  of  Csorgo  and  Revest  (1978) 
regarding  the  deviation  of  Q from  the  true  quantile  function  Q. 

This  theorem  may  be  paraphrased  as  saying  that,  under  suitable 
conditions  on  fQ,  ^ fQ(u)lQ(u)  - Q(u)],  0 < u < 1,  is  asymptotically 
a Brovnlao  Bridge  process. 

For  the  location  and  scale  parameter  model 


the  true  quantile  function  la 

Q(u)  - u + oQ^(u)  (2.4.2) 


where  Q Is  the  quantile  function  corresponding  to  F . n»ua  the 
o o 

process 

*,Q^(<i)Ig(u)  - 11  - og^Cu)]  . u c (0,11)  <2.*.J) 

aay  be  considered  as  a Brownian  Bridge  process,  (B(u),  u e 10,1}), 
for  large  values  of  n. 

Farsen  (1979)  has  justified  writing  the  expression 

^ { 0 (u)(g(u)  - u - og„(u)i  - »(u)  (2.*.*) 

0 0 0 o 


which  holds  asymptotically  as  n *»  •«  The  estimation  of  u end  o 
la  then  seen  to  be  a problem  In  continuous  parameter  time  series 
regression  by  writing  (2.4.4)  as 


(2-A-5) 

00  oo  ooo  a 


where 


Oj  • o/'/n  , 


(2.4.6) 


The  parameter  Is  not  constrained  to  be  related  to  o and  la  estimated 
aa  a free  parameter.  Therefore  its  estimate  provides  a diagnostic 
cltecH  on  the  goodness  of  fit  of  the  model. 


i 
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A. 


2.5  Selection  of  Optimal  Spoclnfis  as  a Regression  Design 
Problem  for  the  Quantilp  Process 

The  regression  model  for  location  and  scale  parameter  estimation 
was  seen  In  Section  2.4  to  be 


<2.S.2) 


and  {»(u),  u c 10,1))  1.  . Broxnl.n  Irldg.  prow*.  Thl.  aod.! 

1.  seen  to  be  a apcclal  case  of  the  aodcl  (2.3.1)  by  Baking  the 
Identiflcatlona  fj(u)  - fj<“>  ‘ *1  ‘ 

■ d and  o • Oj.  Therefore,  in  an  analogoua  manner  to  Section  2,3 
optimal  regreeaion  dealgns  may  be  conaldered  for  what,  in  thla  case, 
would  be  location  and  scale  parameter  eatlmatlon.  However,  a 
comparison  of  the  definition  of  a design  for  a grcwnlan  Bridge  process 
(Definition  2.3.1)  with  the  properties  'of  a act  of  spacinga  shows 
that  such  a design  is  nothing  more  than  a act  of  spacinca  for  sample 
guantiles.  Thus  for  the  model  (2. 5. IX  selecting  an  optimum  design 
for  the  estimation  of  location  and  scale  parametera  la  egulvalent 
to  pcloctlng  nn  optimal  Hrt  of  upncinp.n. 

In  the  light  of  the  previous  dlacuaalon.  It  follows  that 
densities  which  generate  seguencea  of  asymptotically  optimal  spacinga 


sets  (designs)  may  be  obtained  by  an  application  of  Theorem  2.3.1. 
Thla  statement  la  fomallied  In  Che  next  theorem. 

Theorem  2.5.1.  Let  f 0 end  0 ‘f  Q be  twice  continuously  differ- 
o o o o o 

•mclmblm  on  (Ogl)  aod  poMM«  th«  rtpreaencacloaa 

‘o‘>o<'‘'  ■ * /'l‘o‘'o**^'"'‘g<'‘’‘^‘‘*  (2.5.3) 


<)o(“>(,QoW  - - / lQ„<t)f,Q,(t))"IC,(u.t>dt 


(2.5.*) 


The  follovlng  concluatona  hold: 
1.  Tho  deaoity 


h*(u) 


(f  Q "(u))^* 

0 o 

/'(f  Q "(t))\t 
0 ® ® 


(2.5.5) 


generateo  « sequence  of  ssymptotlcslly  optimsl  spsclngs 
sets  for  the  estimstion  of  w when  o Is  known. 


2.  The  density 


h*(u)  • 


([Q^(u)f^Q^(u))”)^3 

/^'(lQ^(t)f^q^<t))”)^>dt 


(2.5.6) 


generates  s sequence  of  asymptotically  optimal  spsclngs 
sets  for  the  estimation  of  o when  u Is  knowoe 
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3.  Ut 


»(u)  - -(If  0 (u)r’.[Q^(u)f  Q (2.5.7) 

o o o o o 


and  dcfioa  th«  Infonation  aatrlx.  A,  by 


oooo  ooooo 
00  00  000  ooo-* 


ITm  denolcy 

h*(u)  - 


(2.5.8) 


(2.5.9) 


button  (unction  U.  Define  for  e epeelfied  nusber  of  epeclng*>  k. 
the  functloiu 


k4>l 

-4«»  • 


" U + ij 

" li  + ij 

. (2.5.10) 


(2.5.11) 


generates  s sequence  of  ssyoptotlcslly  optlasl  spaclngs 
sets  for  the  slnultaneous  estlnation  of  p end  o. 

Reaark  2.5.1.  The  asyaptotic  optimality  of  the  spaclngs  sets  may 
be  interpreted  as  meaning  that  as  the  number  of  spacings  in  a set| 

t 

k,  grovs  large  the  spaclngs  in  Theorem  2.5.1  give  rise  to  estimators 
vith  approximately  the  same  efficiency  as  cstisiators  based  on  the 
optimal  set  of  k spaclngs. 

Given  any  density  function,  in  particular  those  in  Theorem  2. 5. It 
the  form  of  the  estimators  may  be  deduced  from  those  in  Chapter  1. 

It  will  be  useful  to  adopt  a somewhat  different  notation  than  that 
employed  there.  Let  h be  a density  function  with  associated  distri- 


( 

‘ ff  A 

Irh). 

L-if  1 1 . 

fl  - 1]  ‘ "’o 

* U + ij 

k * ij 

- (H^l  > IkTrl  (rhl  > 


A(h)  - Kj^(h)Kj(h)  - K3(b)2  . (2.5.13) 
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Also  define  the  welpht  functions  Wy(l»h),  W^(i,li).  S^CiJ0» 
5g(l,h)  and  correction  factors  C^fh),  Cg(u)  as  follows: 


V^(l.h) 


lt^(h) 


...  /t.“* f 1 + 1 

) - f.-JoCH'* 

1 

k + 1 

■im 

1 

[k  4 1 

J 

Kj(h) 

Kj(h) 


(2.5.15) 


K,(h) 

e„(h>  - — , (2.5.17) 

" Kj(h) 


S (l.h)  --i-  (K  (h)W  (l.h)  - K (h)W  (l.h)J  , (2.5.18) 

“ 8(h)  * " ^ ® 

S_(l.h)  --i-  [K,(h)W  (l.h)  - lC,(h)W  (l.h)J  . (2.5.19) 

® */k\  2 M 3 0 


Estlaators  of  M snd/or  o based  on  secs  of  k aayi^totically 
optisMl  apaclngs  are  given  as  follows: 

I.  Let  h*  be  defined  as  in  (2.5.3).  An  estisacor  for  u 
when  0 la  known  la 

u*  - ^I^W/l.l>*>«CH*‘'|i;-^])  - oC^(h*)  (2.5.20) 


with 

K, (h*) 

«E(v*)  . (2.5.21) 

O O 0 0 

2.  Lot  h*  be  defined  as  In  (2.5.6).  An  estlnator  for  o 
when  u 1*  known  la 


qS  m 


(2.5.16) 


^ - -if  1 1 

I W (i,h*)Q(M*  r-J-r  ) - pC  (h*) 
® I*  ^ ® 


(2.5.22) 


I 


}1 


with 

ARE(o*) ^ . (2.5.23) 

<(  Q ■<!  Q •<)  > 

' 0^0  o © O 0 

3.  Ue  h*  be  defined  es  In  (2.5.9).  Slaultaneous  eBClBators 
of  M end  a ere 
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For  e eyMKctric  dletribuclon  it  con  be  ehown  that  the  off 
diagonal  elementa  of  the  natrlx  A in  (2.5.8)  are  aero  (aee  Paraen  * 

(1979)}.  Thla  forimla  (2.5.9)  nay  be  alapllfied  aoieewhat  in  this  case.  ^ 

Aa  a apccific  exaaplc  of  a ayaactric  distribution,  consider 
the  noraal  dlstributlcm.  The  c.d.f.  and  p.d.f.  of  the  noraal  diatri-^ 
bution  are  often  denoted  by  9 and  4 respectively.  In  this  notation, 
the  functions  that  have  been  considered  so  far  are 


k - 1 ) 

ii*.  IS(i,h*)Q(H*  krvT  > 

1.1 « 


(2.5.24) 


(2.5.25) 


(2.5.26) 


It  »houl6  b.  »ote6  that  th***  foniulae  way  b.  adapt..!  apaclnga 
lenarattd  by  an  arbitrary  danalty,  b,  by'aubatltut  r ..r  h*  In 

aquatlona  (2.5.20)  through  (2.5.26). 

Ihc  romalndcr  of  thla  accclon  will  be  devoted  to  conalderlng 
apeclal  case,  of  the  previous  resulta.  Theae  serve  to  point  out 
certain  aliopUf Icallons  as  well  aa  uosw  aliortconlnga  of  Theorew  2.5.1. 
First  the  case  of  a aymetrlc  distribution  (in  particular  the  norwal) 
will  be  considered.  Secondly,  the  eaponentlal  distribution  will  be 
seen  to  pose  certain  problems  In  the  application  of  Theoren  2.5.1. 


Q (u)  - 6 (u)  , 

c 


(2.5.27) 


f Q_(u)  - (2.)*^  exp  (-‘li|6“(o)|2)  . (2.5.28) 

o o 


It  follows  fron  (2.5.28)  that 


-W'W  -.030 


-I»o<!o'“«of”»"  - ^ ■ 


Q.(u) 


(2.5.29) 


(2.5.30) 


and  A - diag  (1,2). 

The  optlnal  density  for  einultaneous  esclnatlon  of  u and  o is 

(1  + 2|e‘*(u)|2)\xp  {‘/3|»‘'(u)|2) 

h*(u).7-j . (2.5.31) 

/ (1  + 2|6‘'(t)|2)\xp  (Valb'Vu'l’ldt 
0 

The  corresponding  optlnal  c.d.f..  It*,  must  be  tabulated  by  numeric 
integration  (see  Chapter  3).  Tor  a given  k,  the  asynptotlcally 
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optimal  bpacliigSf  11*  ^ • 1 • 1,  .*•,  k*  can  thou  be  found 

by  interpolation. 

For  the  exponential  dlatribution 


The  osyaptotlcally  optimal  apaclnga  generated  by  h*  are 


k . (2.S.3S) 


Q^(u)  • -ln(l  - u)  , (2.5.32) 

f Q (u)  • 1 - u . (2.5.33) 

o o 


The  { Q function  in  thia  caac  cannot  be  represented  as  In  (2.5.3). 

0 o 

In  fact  the  f Q function  la  not  even  In  the  RKllS  of  K_  as 
o o B 

f Q (0)  «<  0.^  Conaeouently*  Tlteorea  2.5.1  does  not  pertain  to  the 
o o 

f Q function  In  (2.5.33).  This  means  that  for  exponential  data* 
o o 

estimation  of  p vhen  o is  known  or  simultaneous  estimation  of  p and 

0 may  not  be  accomplished  using  the  theory  developed  in  this  section. 

However,  the  function  f Q *Q  does  possess  the  desired 
0 0 0 

representation.  Therefore,  It  is  possible  to  obtain  apaclngs  for 
estimating  o when  p is  known  via  equation  2.5.6.  The  optimal  density 

is 


h*(u) 


(u  - 

/'<t  - D'^’dt 

0 


(2.S.34) 


*0llu*r  distributions  (c.g.  the  Pnroto)  nlyo  h.nvo  this  problem. 
In  such  cases,  there  appears  to  be  a correspondence  between  the 
distribution  being  non-regular  and  the  functiona  not  being  a 

member  of  the  RKHS  of  K.^.  Whether  this  holds  in  genaral  la  a topic 
for  further  research. 


2.6  Comparison  with  Other  Approaches 


In  this  section  the  results  of  the  previous  section  are  applied 
to  solve  problems  considered  by  Chemoff  (1971)  and  Samdal  (1962). 

Chemoff  (1971)  considers  the  optimal  apaclngs  problem  for  the 
normal  distribution.  He  assumes  that  the* normalized  quantiles,  s^, 
are  selected  according  to  some  non-negative  density  function  g(z). 
This  may  be  interpreted  as  meaning  that  for  large  k there  should  be 
approximately  kg(z)d  quantiles  in  a small  interval  (s,  t A). 

Using  the  notation  adopted  In  Section  2.5  for  the  normal 
distribution,  the  ARC  of  a linear  aystematie  estimator  of  p ia 
proportional  to 


R 


1 


k+1  U(Zj)  - 


(2.6.1) 


li+l 

K - t 
^ 1-1 


(♦(.j)  - ♦(.i.j)!  (2.6.2) 
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and  note*  th*t  for  large  k 

*1  ■ ■ I ■ 1 • (2.6.3) 


Thu*  1*  • discrete  approxlBAtion  to 

/ i2*(*)d»  , (2.6.4) 


Therefore  the  problea  of  selecLing  optiaal  spacing*  may  be  viewed 
as  one  of  selecting  a best  set  of  points  to  discretely  approxlaate 
(2.6.4). 

By  expanding  6(z)  in  its  Taylor  series  about  the  differences 

between  the  integral  (2.6.4)  and  can  be  shown  to  be  approxlasted 
for  large  k as  follows: 


k+1  , k-fl  ♦(*  ) 

'hi  £ ('i  - ' -h  £ <*1  - 'i-i) 

i-l  ‘ ‘ ^ ‘ ^ 12k2  1-1  g2(x  ) ‘ ‘ ‘ 


: _i_  j‘  -Mil  i,  (2.6.5) 

12k2  — e*(i) 


since  (*.-*.  ,)  * “7“^ — \7  • Thus  alnlaizlng  the  error  in 
i i-l 

approxlokitlng  (2.6.4)  and  selecting  the  density  g*  that  ainimlzes 
the  right  hand  side  of  (2.6.5)  arc  (.-isynptotic.nlly)  equivalent 
probleaa.  Chemoff  solves  this  by  variational  aethods  to  obtain 

(2.6.6) 


• 1/ 

where  C • / 4 ^(s)dz.  As  k -»  «•  the  z^  correspond  to  approxiaately 

the  ^ Y quAntile  of  the  g(z)  distribution.  Thus  Chemoff's  solution 
is  Co  take  the  s^  such  cbt-iC  they  satisfy 

/ ♦^’(*)d«  - / ♦''’(«)<!»  . (2.6.7) 

The  error  In  approxlast ing  (2.6.4)  is  approximately 

— Ij  6'^(i)6»1’  . (2.6.8) 

k2l2  — 


Equation  (2.6.7)  can  be  seen  to  be  the  saae  solution  as  sug- 
gested in  Theorea  2.5.1  by  making  the  change  of  variable  z • 6 \u) . 
Thla  eaae  procedure  shows  that  the  problea  of  finding  s density 
which  alniaises  the  error  tcra  (2.6.5)  is  identical  to  the  problea 
of  finding  a density  that  attains  the  bound  (2.1.13)  (here  f^^  * S. 
Thus  not  only  the  solutions  but  the  problea*  themselves  are  the  same. 

SXmdal  (1962)  treats  the  problea  of  aelecting  optimal  spacing* 
as  one  of  selecting  an  optimal  generating  function.  He  defines  a 
generating  function  to  be  a non-negative  density  function  defined 
on  an  interval  (a,b]  of  the  real  line.  Let  C be  the  c.d.f.  that 
corresponds  to  a density  function  g.  Denote  by  C ^ the  inverse 
function  of  G.  A set  oi  k spacings,  (u^,  u^*  .(>*  u^}«  taken 
according  to  g satisfy 


where  F la  the  c.d.f.  in  (2.4.1). 
o *■  , 
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Under  the  assunpeion  that  g and  Its  first  derivative  arc 
bounded  and  continuous  with  g * 0 » the  author  shows  that  the  loss 
In  efficiency  In  estimation  usinc  linear  estinates  based  on  spacincs 
chosen  according  to  g is 


. • J*(x)f  (x)  . 

L / — + 0(k  ) (2.6.10) 

12(k  + 1)2  -«  (g(x)l 


where 

d^logf  (x) 

J*(x)  ■ • v.(x) 

dx^  * 


dlogf  (x) j 


-iRiW 

• ('».(»).  w,(x))A 


for  0 known  . 


for  \$  known  , (2.6.11) 


for  both  u and  o known  . 


He  then  defines  nearly  optimal  spaclngs  to  be  those  obtained 
according  to  a generating  function  that  minimizes  (2.6.10).  A 
calculus  of  variations  argument  shows  the  optimal  function  to  bs 

**(x)  - (J*(x)f^(x)l'^3  (2.6.12) 

where 

C-/  IJ*(x)t_,(x))‘'j6x 


Again  by  letting  x - Q^(n)  and  substituting  the  functions 

f^Q  (u)  and  f Q (u)Q  (u)  for  those  in  Rerurks  2.3.2  and  2.3.3. 
o o o o 0 ' 

the  problems  considered  by  Sarndal  and  subsequent  solutions  are 
teen  to  be  equivalent  to  those  In  Section  2.5. 


(2.6.13) 


40 


39 


3.  APPLICATIONS 

3.1  Prclialnarles 


0 


If  y ftnd  o unknown  , 


The  regression  khIoI  for  locsclon  onJ  scslc  parameter  cstlKiclon 
has  been  seen  to  be 

f q (u)q(u)  - uf^q^Cu)  + * OjB(u)  O.i.i) 


o/<^ 


(3.x. 2) 


and  (B(u),  u c fO.lli  Is  a Brownian  Bridge  process.  From  this  model* 
it  can  be  deduced  that  selecting  optimal  spacings  is  equivalent  to 
selecting  optimal  regression  designs. 

In  Chapter  2,  design  sequences  that  were  asymptotically  optimal 
were  considered.  Such  sequences  for  the  Brownian  Bridge  process 
were  generated  by  density  functions  on  10,1).  For  the  model  (3.1.I)i 
the  optimal  densities  are  given  by  « 

K q.(u)")^3 


h*(u)  - 


f'lf  q (t)"l’'’<lt 
0 ° “ 


(iq  (u)f  q (u)) 

o o o 


.,^3 


/'(lO  (t)I  q 

0 ® ® ® 


if  o is  known 


if  y is  known 


(3.1.3) 


where 

♦•(u)  - -(If  q (U)l",  [q  {u)f  q (u))")  (3.1. «> 

0 O 0 0 0 

and  A la  the  information  matrix  defined  in  Sectloo  2.5. 

In  order  to  determine  the. asymptotically  optimal  set  of  spacings 
for  a given  number*  k*  of  order  atatistlca** it  ia  flrat  necessary 
to  compute  the  optlaml  density*  h**  and  its  corresponding  c.d.f.  H*. 

Then  the  required  spacings  are  the  points  H*  * [v~* ^ 

To  utilize  the  preceding  theory  for  data  analysis*  the  researcher 
would  require  the  H*  or  H*  functions  for  many  of  the  cosmon 
distributional  forms.  Such  functions  are  derived  in  Section  3.2. 
Comparison  of  the  spacing^  obtained  using  (3.1.3)  with  those 
obtained  by  other  authors  are  .ilao  provided. 

Once  a epaclngs  set  has  been  decided  apon,  the  estimators  can 
then  be  constructed.  The  ABLUE's  for  a specified  epaclngs  set* 
u^*  U2 » are  given  as  follows: 

1.  When  0 is  known*  the  estimator  fdr  y is 

k 

y - r^Wy(i)Q(u^)  - oC^  , (3.1.5) 

2.  When  y Is  known,  the  eatlmetor  for  0 ia 

0 • I w (l)Q(u,)  - yC^  . (3.1.6) 

1-1  ® " 
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and 


SlBultancous  Gsllm.itors  for  u and  a arc 

p - r S (l)Q(u  ) 


0 


k 

£ S^(l)Q<u  ) . 

1«1  ” 


(3.1.7) 


(3.1.8) 


The  exact  foraulas  for  these  estinatora  and  their  efficiencies  nay 

be  obtained  by  referring  to  Section  2.5.  In  subsequent  sections, 

the  coefficients  W , W . S . S , and  the  correction  factors  C and 
W O*  u 0 P 

will  be  presented  for  certain  spaclngs  sets  of  interest.  It 
should  be  noted  that  the  correction  factors  are  zero  in  the  case 
of  syrranetric  spaclngs  for  a syni&etrlc  distribution. 

In  Section  3.3,  the  problem  of  selecting  order  statistics  for 
sucnarizlng  large  data  sets  will  be  considered.  It  will  be  seen 
that  the  selection  of  a few  strategically  placed  order  statistics 
will  provide  sufficient  information  to  construct  efficient  location 
and  scale  paratncicr  cstinutors  under  a variety  of  distributional 
assumptions. 

t 

3.2  Spaclngs  for  Some  Coognon  Distributions 


P (x)  - tCx)  . / ♦(t)dt  . 


(3.2.2) 


qju)  - ♦‘‘(u) 


(3.2.3) 


f Q (u)  - ♦r‘{u)  - (2.)‘'^2,„P  (-|♦■'(u)P/2)  . (3.2.4) 


The  matrix  A Is  found  to  be 

A - dlag(1.2) 


(3.2.5) 


When  0 is  known  spaclngs  taken  according  to  the  density 


h*(u)  ■ • 


/'(♦*■' (t)r^’dt 
0 


(3.2.6) 


will  be  asyBDtotlcally  optimal  for  estimating  the  mean.  Since  h* 
is  sysnctric,  the  spaclngs,  u^,  will  satisfy 


Vw  - 1 • o.i.i) 


3.2.1.  Normal  Distribution 


Using  (3.2.6)  it  can  be  shown  that 


The  c.d.f.  and  p.d.f.  of  the  standerd  normal  distribution  are 
usually  denoted  by  6 and  4 respectively.  In  this  notation,  the 
functions  to  be  considered  are: 

f (x)  - ♦(x)  - (2.)*’^exp  (-x2/2)  , -.  < X < - , (3.2.1) 

o 


H.‘'(u)  . .(/J  .'‘(u))  . (3.2.8) 

The  function  in  (3.2.8)  Is  shown  graphically  in  Figure  I and 
is  tabulated  in  Table  1 for  points  in  the  Interval  I0,.53.  To  find 
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Plgur«  1.  Normal  Distribution,  o Known; 
The  Function  H*  * (u) 


Table  1.  Norsisl  Distribution,  o Known; 
The  Function  H*  ^ (u) 


u 

u 

.01 

.00003 

.26 

.13136 

.02 

.00019 

.27 

.14457 

.03 

.00056 

.28 

.15665 

.04 

.00122 

.29 

.16853 

.05 

.00226 

.3 

.16141 

.06 

.00366 

,31 

.19489 

.07 

.00539 

.32 

.20897 

.08 

.00776 

.33 

.22363 

.09 

.01017 

.34 

.23885 

.1 

.01321 

.35 

.25143 

.11 

.01659 

.36 

.26763 

.12 

.02068 

.37 

.26434 

.13 

.025 

.38 

.29806 

.14 

.03074 

.39 

.31561 

.15 

.03593 

.4 

.32997 

.16 

.04272 

.41 

.34458 

.17 

.04947 

.42 

.36317 

.16 

.05705 

.43 

.3707 

.19 

.06426 

.44 

.39743 

.2 

.07353 

.45 

.41293 

.21 

.08226 

.46 

.4325 

.22 

.09176 

.47 

.44828 

.23 

.10027 

.48 

.46414 

.24 

.11123 

.49 

.48404 

.25 

.121 

.5 

.5 

45 


K*  ^ values  for  u > .5  the  relation  {5.2.7)  may  he  used.  Table  2 
contains  spacings  c.nlculaLcd  using  (3.2.8)  .*101!  their  corresponding 
coefficients  for  k ■ 7,9.  The  asymptotically  optiswil  spacings  for 
k • 2,7,9  arc  compared  to  the  optimal  setr  obtained  by  Ogawa  (1951) 
in  Table  5. 

To  estimate  o when  p ie  known,  the  optimal  density  is 

h*(u) . (3.2.9) 

0 

A tabulation  of  the  function  H*  \u)  is  given  In  Table  4 for  u In 
the  interval  [0,.5].  For  other  values  of  u the  relation  (3.2.7)  may 
be  used.  A graph  of  H*  ^ is  shown  in  Figure  2.  Table  5 contains 
the  asymptotically  optlnwil  spacings  and  corresponding  coefficients 
and  efficiencies  fork  « 7,9.  Table  6 provides  a comparison  of 
the  asymptotically  optimal  spacings  with  the  optimal  spacing  given 
by  Ogawa  (1951)  for  k ■ 2,6. 

For  simultaneous  cstlmaclon  of  p and  o, spacings  should  be 
taken  according  to  the  density  ( 

(1  + 2l#”\u)l^)  ^^cxp{ ’/jlc*  '(u)]^) 
h*(u)  - . (3.2.10) 

/\l  + (V3U‘*(t)|Mdt 

0 

Since  the  spacings  goncr.ntcd  by  h*  are  symmetric,  the  function  H*  * 
has  been  tabulated  only  over  the  Interval  (0,.S].  This  tabulation 
is  given  in  Table  7.  A graph  of  H*  ^appears  in  Figure  3.  Table  6 


Table  2.  Normal  Distribution,  0 Elnown;  Asymptotically 
Optimal  Spneings  and  Coefficients  for  Seven 
or  Nine  Order  Statistics 


1 

k - 

7 

k 

- 9 

“1 

W^(l) 

“1 

W^(l) 

1 

.023 

.049 

.013 

.028 

2 

.121 

.138 

.074 

.087 

3 

.29 

.201 

.184 

.134 

4 

.5 

.223 

.334 

.164 

5 

.71 

.201 

.5 

.3-3 

6 

.879 

.138 

* 

.666 

.164 

7 

.977 

.049 

.816 

.134 

8 

.926 

.087 

9 

.987 

.028 

T«ble  3.  Normal  Distribution*  a Known;  A Comparison  of  Optimal  and  Asymptotically 
Optimal  Spaclngs  and  Their  Corresponding  Efficiencies 
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Table  4.  Normal  Dletrlbutlen*  y Know; 


C4 

The  PuactloD  U*" 

'(u) 

•e 

r>» 

r*> 

O 

O 

fS 

<o 

r>» 

o» 

Ot 

9> 

& 

u 

**■' (u) 

u 

«*■' (u) 

• l 

•H 

M' 

S ^ 

.01 

.0 

.26 

.04176 

cS 

O ^ 
w u 

O 

O 

M 

m 

m 

«e 

o> 

Q* 

Ot 

.00002 

.27 

.04712 

r 

.03 

.00006 

.28 

.05296 

.04 

.00013 

.29 

.05931 

.05 

.00026 

.3 

.0662 

1 

r«. 

OD 

r4 

«s 

«« 

tn 

.06 

.00045 

.31 

.07367 

•«» 

O 

«e 

o> 

>o 

• 

o> 

<o 

9> 

.07 

.00072 

r32 

.08176 

s 

.08 

.00106 

.33 

.09051 

r* 

.09 

.00156 

.34 

.09996 

■ 

M 

.1 

.00215 

.35 

.11017 

55 

w 

Ok 

.11 

.00288 

.36 

.1212 

O 'H 

W U 

1 o 

s 

r* 

«0 

e« 

«« 

.12 

.00376 

.37 

.1331 

i'5’ 

>» 

.13 

.00481 

.38 

.14596 

•< 

.14 

.00604 

.39 

.15987 

1 

1 

.15 

.00747 

.4 

.17492 

1 

n 

r>. 

0> 

.16 

.00912 

.41 

.19124 

w 

IM 

.17 

.01101 

.42 

.209 

, 

.18 

.01314 

.43 

.2284 

>« 

.19 

.01556 

.44 

.2497 

X 

e 

lA 

.2 

.01826 

.45 

.27328 

23 

0 -r* 

227 

773 

o 

o 

eo 

.21 

.02127 

.46 

.29968 

a fh 

1 

.22 

.02461 

.47 

.32982 

3 

.23 

.02831 

.48 

.36537 

.24 

.03239 

.49 

.41045 

M 

u 

.25 

.03686 

.5 

.5 

• 

m 

• 

0. 

9 

9 

9 

9 

9 

9 

9 

9 

9 

•M 
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Flgute  2.  Hormnl  Distributlonp  p Known; 
The  Function  H*”^(u) 


Teblh  5.  Noraal  Distributlonp  p Rnovn;  Asyaptotlcally 

Optimal  SpaclniiS,  Coefficients  and  Efficiencies 
for  Seven  or  Nine  Order  Statistics 


1 

k - 7 

k • 9 

“i 

1 

.004 

O 

1* 

.002 

-.015 

2 

.037 

-.111 

.018 

-.062 

3 

.14 

-.2 

.066 

-.115 

4 

.5 

.0 

.175 

-.164 

5 

.86 

.2 

. .5 

.0 

6 

.963 

.111 

.825 

.164 

7 

.996 

.031 

.934 

.115 

8 

.982 

.062 

.998 


9 

Efficiency 


8848 


9231 


015 
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Table  6.  Normal  Distribution,  Known;  A Comparison 

of  Optiautl  end  AsympLoticully  Optimal  Spacin&s 
and  Their  Corresponding  Efficiencies 


Table  7.  Normal  Distribution,  Loth  p and  o Unknown; 
The  FunctlM 


k - 2 

k - 6 

u 

H*  ' (u) 

u 

H*‘‘ (u) 

Asymptotically 

Asymptotically 

.01 

.0 

.26 

.06164 

Spacing 

Optimal  Optimal 

Optimal 

Optimal 

.02 

.00003 

.27 

.06938 

.006 

.01 

.03 

.00009 

.28 

.07779 

.091  .069 

.04 

.0002 

.29 

.08688 

“2 

.909  .931 

.056 

.055 

.05 

.0004 

.3 

.09672 

“3 

.228 

.17 

.06 

.00069 

.31 

.10731 

.772 

.83 

.07 

.0011 

.32 

.11872 

.08 

.00166 

.33 

.13097 

“5 

.946 

.945 

.09 

.00237 

.34 

.14411 

“6  . 

.994 

.99 

.1 

.00326 

.35 

.15819 

.U 

.00436 

.36 

.17323 

Efficiency 

.6A1S  .6322 

.8761 

.8943 

.12 

.00569 

.37 

.1893 

.13 

.00726 

.38 

. 20644 

.14 

.00911 

.39 

.22467 

.15 

.01125 

.4 

.24407 

.16 

.01372 

.41 

.26465 

.17 

.01652 

.42 

.28644 

< 

.18 

,0197 

.43 

.30947 

.19 

.02327 

.44 

.33372 

.2 

.02727 

.45 

.35917 

.21 

.03171 

.46 

.38574 

.22 

.03663 

.47 

.41331 

.23 

.04205 

.48 

.44171 

.24 

.048 

.49 

.4707 

.25 

.05453 

.5 

.5 

i 
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Figure  3.  Normal  Distribution,  M and  o Unknown; 
Hie  Function  U*  ^(u) 
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contains  asymptotically  optimal  spsclngs  and  the  corresponding 
coefficients  and  efficiencies  for  estimators  based  on  seven  or 
nine  order  statistics. 

In  the  esse  that  k ■ 2,  Ogawa  (1951)  haa  shown  chat  the  spacinga 


almultancoua  estimation  of  m and  o.  Ths  apacings  obtained  using 
(3.2.10)  arc  * .132  and  U2*  ■ .868.  The  efficiency  for  this 
latter  apacings  set  is  .4065. 


3.2.2  Exponential  Distribution 


The  exponential  c.d.f.  is 


F^(x)  • 1 - exp  (-x)  . X > 0 . 


The  functions  required  in  construction  of  the  optimal  density  arc: 


f (x)  - exp  (-x) 


Q^(u)  - -logd  - u) 


000  000 


Table  8.  Noittal  Diutrlbulion,  noth  U n»d  O Uiiknowu; 

Aayaptotically  Optimal  Spacinge,  Coof flclentt, 
and  Efficiencies  for  Seven  or  Nine  Order 
StatUcica 


1 

k - 7 

k - 9 

“i 

s/l) 

8,(1) 

“t 

s^d) 

S (1) 

0 

1 

.006 

.018 

-.025 

.003 

.009 

-.012 

2 

OOS 

.093 

-.082 

.027 

.044 

-.045 

3 

.197 

.24 

-.106 

.096 

.109 

-.075 

4 

.5 

.357 

.0 

.244 

.214 

-.075 

5 

.802 

.24 

.106 

.5 

.286 

.0 

6 

.945 

.093 

.082 

.756 

.214 

.075 

7 

.994 

.018 

.025 

.903 

.109 

.075 

8 

.973 

.044 

.045 

9 

.997 

.009 

.012 

Efficiency 

.8382 

.8903 

Aa  was  seen  in  Section  2.5,  tetiKitioo  of  \t  wlten  o la  known 
or  simultoneoua  estimation  of  both  u and  o could  not  be  acconpllshed 
using  the  design  techniques  of  Chapter  2.  Consequently,  M»ly  the 
estimation  of  o when  y Is  known  will  be  considered. 

When  the  location  parameter  Is  known,  the  optimal  density  of 
scale  parameter  etclmstlon  Is 


h*(u) 


(u  - 

/'(t  - 
0 


H*"  (u)  - 1 - (1  - u)*  . 


(3.2. U) 


(3.2.17) 


The  function  (3.2.17)  is  shown  graphically  In  Figure  4. 

The  optimal  spacings  for  estimating  o when  p Is  known  have 
been  found  by  Ssrhan  end  Greenberg  (1958) . A cosq>arlsoo  of  the  per* 
formance  of  these  optimal  spacings  with  that  of  the  spacings  obtained' 
using  H*  ^ Is  provided  by  Table  9 In  the  case  of  k • 2,7,9.  'Rie 
asymptotically  optimal  spacings,  correction  factors  and  coefficients 
ere  given  In  Table  10  for  estimators  based  on  either  seven  or  nine 
order  statistics. 

3.2.3  Pareto  Distribution 

Hie  distribution  function  for  the  Pareto  distribution  Is 


TU)  - 1 - (1  + »)■'’  , » > 0 
P. 


(3.2.18) 


Table  10.  Exponential  Dlstrlbutlont  M Known;  Asymptotically 
Optimal  Spaclncs,  Coefficients  .ind  Correction 
Factora  for  Seven  or  Nine  Order  Statlscica 


1 

k 

- 7 

k - 

9 

w,(l) 

“i 

1 

.33 

.297 

.271 

.248 

2 

.578 

.219 

.488 

.196 

3 

.756 

.152 

.657 

.15 

4 

.875 

.129 

.784 

.11 

S 

.974 

.033 

.875 

.076 

6 

.984 

.016 

.936 

.049 

7 

.998 

.005 

.973 

.027 

a 

.992 

,012 

9 ' 

.999 

.001 

c 

.849 

.869 

0 

where  v > 0 la  a known  ahape  parameter.  The  required  functions  are: 


f (»)  - v(l  + , 

o * 

(3.2.W) 

Q^(u)  - a - u)"''''- 1 . 

(3.2.20) 

tQM  - vU  - u)‘*  ''''  . 

0 0 

(3.2.21) 

Section  2.2  for  this  notation).  Thus  as  in  the  case  of  the  exponential 
dlatributlon»  optimum  spacings  for  estimating  u when  o la  known  or 
for  simultaneous  parameter  estimation  cannot  be  obtained  using  Che 
theory  of  Chapter  2.  However,  the  Q *f  Q function  has  the  desired 

O 0 0 

properties  so  the  estimation  of  a when  v is  known  using  asymptotically 
optimal  apaclnga  can  still  be  accomplished. 

The  optimal  density  for  scale  parameter  estimation  in  the  case 
that  V Is  known  Is 


2«2v 

j\l  - t)  dt 
0 


H*  (u)  - I - (1  - u)‘ 


: function  la  shown  graphically  in  Figure  S for  v • .5,lt2,3. 
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Figure  S.  Farcto  Distribution,  p Known,  v " 
The  Function  11*  ^(u) 


In  the  ease  that  v ■ 1,  the  epocingn  obtained  iron  (3.2.23) 
are  the  pointa  ^ , i « 1,  k.  This  aolution  agrees  with 

the  optimal  spaclnga  given  by  Kulldorf  and  Vannnan  (1973).  An 
explicit  expression  for  the  ABUIE  is 


6 

k(k  + l)(k  + 2) 


k 

r (k  - i + 1)2 
1-1 


2k  ■»  1 
k 2 


V. 


(3.2.24) 

For  V fi  1 the  spaclnga  generated  by  h*  are  only  asymptotically 
optisMl.  A comparison  of  the  optimal  spacfhgs  with  those  generated 
by  (3.2.22)  is  presented  in  Tables  11*13  for  the  cases  v • .5,2,3 
respectively.  Tables  14‘>16  contain  the  asymptotically  optimal 
spacing  and  corresponding  coefficients  and  correction  factors  for 
estimators  based  on  either  seven  or  nine  order  statistics  for  when 
V • .5,2,3  respectively. 


3.2.4  Cauchy  Distribution 


For  the  Cauchy  distribution,  the  required  functions  are: 


j tan'*  (x)  , -^i  < X < - , 

(3.2. 2S) 

...  1 1 
f (x)  

(3.2.26) 

® ’ 1 + x2 

Q (u)  - t.n[i(u  - -J-)] 

0 c 

(3.2.27) 

f Q„(«)  - ¥“'aln^(Ttu) 
0 o 


(3.2.28) 


Table  11.  Pareco  Distrlbutloni  v • .5,  u Known;  A Comparison  of  Opclmnl  and 

Asymptotically  Optimal  Spaclnga  and  Their  Corresponding  Efflclenelea 


r 


Efficiency  .8701  .8711  .9808  .9809  .9877  .9877 


Table  13.  Pareto  Distribution,  v • 3,  u Knovn;  A Conparison  of  Optimal  and 

Asymptotically  Optimal  Spaclnga  and  Their  Corresponding  Efficiencies 
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Tabic  15.  Tarcto  WscrJbutlon,  v - 2,  M I’nowii;  Asyaiplol Jcally 

Optimal  Spaclngs,  Coefficients  and  Correction  Kaclora 
for  Seven  or  Nine  Order  Statistics 


Table  16.  ?areto  Distribution,  v - 3.  p Knovn;  Asymptotically 
Optimal  Spacings.  Coefficients  and  Correction 
Factora  for  Seven  or  Nine  Order  Statistics 


1 

k 

- 9 

1 

k - 

7 

k 

- 9 

W_,(l) 

w,(l) 

“i 

W^(l) 

1 

.182 

.876 

.146 

.737 

1 

.214 

1.171 

.173 

.985 

2 

.35 

.645 

.284 

.583 

2 

.404 

.861 

.331 

.769 

3 

.506 

.447 

.414 

.447 

3 

.571 

,598 

.471 

.598 

6 

.646 

.286 

.535 

.328 

4 

.713 

.382 

.601 

.441 

s 

.77 

,161 

.646 

.228 

5 

.829 

.215 

.712 

.305 

6 

.875 

.071 

.747 

.146 

6 

.918 

.094 

.808 

.194 

7 

.956 

.017 

.836 

.082 

7 

.976 

.022 

.885 

.11 

8 

.911 

.036 

6 

.945 

.048 

9 ' 

.968 

.009 

9 

.984 

.011 

C 

2.503 

2.595 

2.343 

3.461 

0 

G9 
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For  the  computotlon  of  efficiencies  It  should  be  noted  that 


1 

2 * 


(3.2.29) 


Table  17.  Cauchy  Distribution,  o Known; 
The  Function  H*(u) 


When  the  scale  paranoter  Is  assumed  known,  spoclngs  taken 
according  to  the  density 

[2icos(?vu)  j 

h*(u) (3.2.30) 

/ [2»cos(2st)  J 
0 

will  be  asymptotically  optimal  foe  estimating  the  location  parameter. 
A tabulation  of  the  optimal  c.d.f.,  H*,  is  given  in  T.ible  17  for 
the  interval  (0,.5].  Values  for  )1*  ^ may  be  calculated  from  the 
table  by ^interpolation  in  combination  with  the  use  of  (3.2.7).  A 
graph  of  H*  ^ is  shown  in  Figure  6. 

Bloch  (1906)  has  considered  the  estimation  of  u by  a linear 
function  of  five  order  statistics.  The  optimal  spaclngs  set  in  this 
case  was  found  to  be  (.13,  .4,  .5,  .6,  .87)  yielding  an  asymptotic 
relative  efficiency  of  .9516.  Interpolation  in  Table  17  results  in 
the  spaclngs  cct  (.125,  .372,  .5,  .628',  .871).  This  latter  set  has 
.9481  as  its  efficiency.  Asymptotically  optimal  spaclngs  and  their 
corresponding  coefficients  and  efficiencies  arc  given  in  Table  18  for 
either  seven  or  nine  order  statUtlcs. 


3.2.5  Logistic  Distribution 


A frequent  parameterixatlon  for  the  logistic  distribution  is 

1-1 


F(x)  - 


.«  < X < • 


(3.2.31) 


u 

H*(u) 

u 

H*(u) 

.01 

.01402 

.26 

.25133 

.02 

.028 

.27 

.25422 

.03 

.04191 

.28 

.25829 

.04 

.0557 

.29 

.26336 

.05 

.06935  , 

.3 

.26935 

.06 

.08281 

.31 

.27616 

.07 

.09604 

.32 

.28373 

.08 

.10902 

.33 

.29202 

.09 

.12171 

.34 

.30095 

.1 

.13406 

.35 

.3105 

.11 

.14605 

.36 

.32061 

.12 

.15762 

.37 

.33125 

.13 

.16875 

.38 

.34238 

.14 

.17939 

.39 

.35395 

.15 

.1895 

.4 

.36594 

.16 

.19905 

.41 

.37829 

.17 

.20798 

.42 

.39098 

.18 

.21627 

.43 

.40395 

.19 

.22384 

.44 

.41719 

.2 

.23065 

.45 

.43065 

.21 

.23664 

.46 

.4443 

.22 

.24171 

.47 

.45809 

.23 

.24578 

.48 

.472 

.24 

.24867 

.49 

.48598 

.25 

.25 

.5 

^5 

Figure  6.  Caucliy  Distribution,  o Known; 
The  Function  H*  (u) 


Table  18.  Cauchy  Distribution,  o Known;  Asymptotically 

Optimal  Spacings,  Coefficients,  and  Efficiencies 


for  Seven 

or  Nine  Order 

Statistics 

1 

k - 

7 

k « 

9 

w^(0 

“i 

1 

.093 

-.031 

.073 

-.014 

2 

.25 

.0 

.161 

-.048 

3 

.407 

.343 

.339 

.104 

4 

.5 

.376 

.427 

.26 

5 

.593 

.343 

.5 

.397 

6 

.75 

.0 

.573 

.26 

7 

.907 

-.031 

.661 

.104 

.639  -.048 
.927  -.014 
.9743 


9 

Efficiency 


9579 
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where  u and  o ore  respectively  the  neon  and  siamtard  dcvlodon  of  the 
distribution.  The  Introduction  of  the  factor  «//3  requires  a slight 
ttodlflcstlon  of  the  vodcl  (3.1.1).  In  this  case  the  ttodel  bccoses 


f^(u)Q(u)  * ufj(u)  + of2(u)  + OgBfu)  (3.2.32) 


f,(u)  - -L  u(l  - u)  , 

1 n 


tjCu)  • u(l  - u)lo»  J - 


0^  • o//n 


and  (B(u),  u c 10,1])  la  a Brownian  Bridge  proceaa.  The  work  at  Gupta 
and  Cnanadeslkan  (1966)  may  be  used  to  deduce  that  for  the  model 


3 + It* 

A - dlag  Y » — 9 — • 


To  estimate  p when  o 1*  known,  spaclngs  should  be  taken  according 
to  the  uniform  distribution  on  [0.1] » l.e.g 


H*’\u)  - u . 


For  a given  value  of  k.  the  spacings  are  the  points  ^ ^ , 1 • 1, 
k.  which  was  seen  to  be  the  optimal  solution  In  Section  1.2.6.* 


*Thls  Is  the  same  as  the  solution  obtained  for  the  Pareto  dlstri** 

butlon  with  Y - 1.  -Thus  9 plot  of  H*“*  Is  given  In  figure  5 (p.  61). 


Reference  may  be  made  to  that  section  for  an  explicit  form  for  the 
estimator  and  its  efficiency. 

VIhcD  p is  known,  the  density  function 


generates  spacings  that  are  asymptotically  optimal  for  estimating  0. 

A tabulation  of  the  corresponding  e.d.f..  H*.  Is  presented  in  Table  19. 
Values  for  H*  ^ may  be  obtained  by  Interpolation  In  this  table  for 
points  in  [0..5]  or  by  the  use  of  (3.2.7)  for  points  outside  this 
interval.  A graph  of  the  function  Is  shown  In  Figure  7. 

Cupta  and  Cnanadeslkan  have  found  optimal  spacings  (or  cither 
two  or  three  syssnctrlc  quantiles  for  the  esclmation  of  0 when  p Is 
assumed  known.  A comparison  of  their  spacings  with  the  ones  generated 
by  (3.2.36)  is  presented  in  Table  20.  Table  21  gives  asymptotically 
optimal  spacings  and  coefficients  for  k ■ 7.9. 

For  simultaneous  estimation  of  p and  o the  optimal  density  Is 


A tabulation  of  the  corresponding  distribution  function.  K*.  Is  given 
In  Table  22.  Spacings  may  be  obtained  from  this  table  through 


T«blc  19. 


Logistic  Distribution,  v Known; 
The  Function  H*(u) 


Figure  7.  Logistic  Distribution,  m Known 
The  Function  B**  (u) 


u 

u 

H*(u) 

.01 

.12276 

.26 

.42392 

.02 

.15956 

.27 

.42946 

.03 

.18612 

.28 

.43478 

.04 

.20774 

.29 

.4399 

.05 

.22632 

.3 

.4448 

.06 

.2428 

.31 

.44951 

.07 

.25771 

.32 

.45401 

.08 

.27139 

.33 

.45832 

.09 

.28408 

.34 

.46244 

.1 

.29593 

.35 

.46636 

.11 

.30706 

.36 

.47009 

.12 

.31758 

.37 

.47363 

.13 

.32755 

.38 

.47697 

.14 

.33703 

.39 

.48012 

.15 

. 34606 

.4 

.48307 

.16 

.35469 

.41 

.48582 

.17 

.36295 

.42 

.48836 

.18 

.37086 

' .43 

.4907 

.19 

.37845 

.44 

.49281 

.20 

.38574 

.45 

.4947 

.21 

.39274 

.46 

.49635 

.22 

.39947 

.47 

.49774 

.23 

.40595 

.48 

.49885 

.24 

.41217 

.49 

.49964 

.25 

.41816 

.5 

.5 

Tabic  20.  Lcr.J.,tlc  Distribution.  |i  Known;  A Comiwrlson 
of  Optimal  and  Asymptotically  Optimal  Spacines 
and  Tbolr  Corrcspondlnj  Kfflcienclcs 


Table  21.  Logistic  Distribution,  u Known;  Asymptotically 
Optimal  Spaclngs.  Coefficients  and  efficiencies 
for  Seven  or  Nine  Order  Statistics 


^ k - 3 ^ k - » 


Spacing 

Asymptotically 

OptlB.nl 

OptlBal 

Asymptotically 

Optimal 

OptlBsl 

1 

“i 

w,(l) 

"i 

w,(l) 

“i 

.134 

sl03 

.065 

.103 

1 

.011 

-.035 

.008 

-.023 

2 

.065 

-.136 

.036 

-.066 

“2 

sdM 

.897 

.5 

.5 

3 

.185 

-.26 

.104 

-.142 

“3 

.935 

.897 

4 

.5 

.0 

.221 

-.214 

Efficiency 

.6686 

.6838 

.6494 

.6838 

5 

.815 

.26 

.5 

.0 

6 

.935 

.136 

• .779 

.214 

— 

7 

.989 

.035 

.896 

.142 

8 

.964 

.066 

9 

.992 

.023 

Efficiency 

.9016 

.9364 
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Table  22.  Logistic  Distribution,  Doth  m ami  o Unknown; 
The  Function  H*(u) 


u 

H*(u) 

u 

H*(u) 

.01 

.10529 

.26 

.37907 

.02 

.1373 

.27 

.38499 

.03 

.16045 

.28 

.3908 

.04 

.17933 

.29 

.39649 

.05 

.1956 

.3 

.40207 

.06 

.21008 

.31 

.40755 

.07 

.22323 

.32 

.41294 

.08 

.23535 

.33 

.41825 

.09 

.24663 

.34 

.42347 

.1 

.25722 

.35 

.42861 

-11 

.26724 

.36 

.43368 

.12 

.27675 

.37 

.43869 

.13 

.28582 

.38 

.44364 

.14 

.2945 

.39 

.44853 

.15 

.30285 

.4 

.45337 

.16 

. 31088 

.41 

.45816 

.17 

.31863 

.42 

.46291 

.18 

.32613 

.43 

.46763 

.19 

.3334 

' .44 

.47231 

.2 

.34045 

.45 

.47697 

.21 

.34731 

.46 

.4816 

.22 

.35398 

.47 

.48622 

.23 

. 36048 

.48 

.49082 

.24 

.36682 

.49 

.49541 

.25 

.37301 

.5 

.5 

interpolation  and  the  uac  of  (3.2.7).  The  graph  of  It**'  1.  .houn 
In  Figure  8. 

Heseancln  (1969b)  has  obtained  epeelngs  for  alaultaneoue 
estlwtion  of  p end  o that  •iniaise  the  sun  of  the  variances  of  the 
estiMtors.  A coaperlson  of  these  suboptisMl  spaclngs  with  spaclngs 
generated  by  (3.2.39)  is  provided  by  Table  23  for  k - 2,7,9.  Table 
24  contains  the  aay^>totleally  optUal  epaclngs,  coefficients  and 
efficiencies  for  slnultaneous  estlnatlon  using  cither  seven  or  nine 
order  ststlatlcs. 

3.2.6  Velbull  Distribution 

The  Welbull  c.d.f.  U 

F^(x)  - 1 - e , * > 0 , (3.2.40) 

where  y > 0 is  a known  shape  psroneter.  Tie  other  functions  that 
will  bs  needed  are: 

f^(x)  • , (3.2.41) 

0 (u)  • 110*7^  1*^''  . (3.2.42) 

SC'* 

f,Q,(u)  - T(1  - '>)H08(7T7]i‘'‘^''  . (3.2.43) 

The  work  of  Harter  and  Moore  (1967)  nay  be  used  to  deduce  that 

..  • (3.2.44) 


Efficiency  .4422  .4142  .8651  .8605  .9095  .9051 
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Tabic  24.  Lor.tffdc  Distribution,  p .ind  o Pnknown;  Asyniitotlc.nlly 
Dptlnol  Sp.nclnfs  and  CooiliclenlM  for  Seven  or  Nine 
Order  Stctistlcc 


For  cstiKiting  o when  m !•  lu\own,  spaclngs  should  be  taken 
according  to  the  density 


k - 7 

k - 9 

- ■ ■ 

b*(u)  - i (1  - u)^>  . 0.2.«) 

1 

"i 

S^(l) 

S,(l) 

0 

S.<‘> 

S,(l> 

- — 

Thus 

1 

.016 

.004 

-.042 

.001 

.001 

-.022 

2 

.093 

.039 

-.143 

.033 

,02 

-.077 

S***(u)  - 1 - (1  - «)*  . (3.J.46) 

3 

.233 

.226 

-.167 

.147 

.088 

-.137 

4 

.3 

.366 

.0 

.296 

.212 

-.131 

As  this  la  the  same  solution  that  was  given  for  the  exponential 

5 

.747 

.228 

.167 

.3 

.293 

.0 

«1  • 

6 

.907 

.039 

.143 

.704 

.212 

.131 

distribution,  the  properties  of  H*  (specifically  its  gr?ph)  can  be 

7 

.964 

.004 

.042 

.853 

.088 

.137 

found  in  Section  3.2.2.  Asymptotically  optimal  specings,  coefficients 

g 

.947 

.02 

.077 

correction  factors,  and  efficiencies  for  seven  or  nine  order 

9 

.99 

.001 

.022 

fficiency 

.6631 

♦ 

.9095 

statistics  are  given  in  Tables  25-27  for  y • %»2,4  respectively. 

3.2.7  Extreme  Value  Dletrlbutlon 


Hie  c.d.f.  of  the  extreme  value  distribution  Is 


F <x)  ■ exp  {-  exp  (-x))  , -•<*<•,  (3.2.47) 

o 

with  correepondinft  p.d.f. 


f (x)  ■ exp  (-(x  ♦ exp  {-x)))  , (3.2.48) 

o 

The  ocher  functions  required  for  optimal  density  construction  are: 


and 


Q^(u)  - -log  log[“] 


(3.2.49) 
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Table  25.  Welbull  DiRtrlbution*  y • %,  u Known; 

Asymptotically  Opclaial  Spacln^K,  Cocf flcicnCS* 
Correction  Factors,  nnd  F.fricicncics  for 
Seven  or  Nine  Order  Scatisclcs 


k - 

7 

k - 

9 

1 

v„(l) 

1 

.33 

.238 

.271 

.186 

2 

.578 

.211 

.488 

.177 

3 

.756 

.165 

.657 

.152 

4 

.875 

.151 

.764 

.122 

5 

.974 

.045 

.875 

.091 

6 

.984 

.023 

.936 

.063 

7 

.998 

.008 

.973 

.037 

s' 

.992 

.019 

9 “ 

.999 

.002 

C 

0 

Efficiency 

.958 

.841 

.978 

.85 
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Table  26.  Welbull  Distribution,  y * 2,  p Known; 

Asymptotically  Optimal  Spaclngs,  Coefficients, 
Correction  Factors  and  Efficiencies  for  Seven 
or  Nine  Order  Statlatlce 


1 

k - 

7 

k 

- 9 

"i 

W_,(l) 

w^(l) 

1 

.33 

.189 

.271 

.139 

2 

.578 

.204 

.488 

.16 

3 

.756 

• .18 

.657 

.155 

4 

.875 

.181 

• .784 

.136 

5 

.974 

.062 

.875 

.11 

6 

.984 

.032 

.936 

.081 

7 

.996 

.012 

.973 

.051 

8 

.992 

.028 

9 

.999 

.004 

C 

.861 

.864 

Efficiency 

.958 

.978 
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Table  27.  Wcibull  lUutrJbutlon,  y “ log(u)  . (3.2. SO) 

A8ymptotlc.nlly  Sp.nclnc8y  Coefficients 

Correction  Factors  .ind  Efficiencies  for  Seven 

or  Nine  Order  Statistics  <»^i>  be  shown  that 


k - 

7 

k 

- 9 

(3.2.51) 

i 

“1 

W_(l) 

"1 

W^(l) 

When  9 Is  known,  the  optimal  density  for  use  In  the 

estimation 

1 

.33 

.15 

.271 

.105 

of  p la 

2 

.578 

.196 

.488 

.145 

3 

.756 

.197 

.657 

.158 

. V 1 -Vi 

.784 

.151 

h*(u)  ■ ? U ^ 

(3.2.52) 

6 

.875 

.217 

5 

.974 

.066 

.875 

.132 

6 

.984 

.046 

.936 

.104 

An  explicit  expression  for  Is  readily  obtained  and 

is  found  to 

7 

.998 

.019 

,973 

.071 

be  of  the  form 

8 

.992 

.042 

» ' 

.999 

.006 

H*'’<u>  - u*  . 

(3.2.53) 

C 

.912 

.913 

Efficiency 

.958 

.978 

A graph  of  this  function  appears  In  Figure  9. 

The  perforaance  of  spaclngs  generated  by  (3.2.52)  is  coopered 
with  that  of  the  optimal  spacings  found  by  Hassanein  (1968)  in 
Table  23  for  k ■ 2,7»9.  The  asymptotically  optimal  spacings, 
coefficients,  and  correction  factors  for  seven  or  nine  order  static** 
tics  are  given  In  Table  29. 

3.2.8  Ganna  Distribution 

The  density  function  for  the  gamma  distribution  is 

«o<*>  ■ • 0<x<-  . (3.2.54) 


Efficiency  .8060  .8203  .968  .9693  .9791  .9798 
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Table  29.  Extreme  Value  Distribution,  o Known;  Asymptotically 
Optimal  SpaclnQs,  CocrCiclcnts  and  Correction 
Factors  for  Seven  or  Nine  Order  Statistics 


1 

k - 

7 

k 

- 9 

“i 

w^(l) 

“i 

w^(i) 

1 

.002 

.03 

.001 

.016 

2 

.016 

.096 

.008 

.056 

3 

.053 

.157 

.027 

.098 

4 

.125 

.199 

.064 

.133 

5 

.244 

.212 

.125 

.158 

6 

.422 

.187 

.216 

.168 

7 

.67 

.118 

.343 

.16 

8 

.512 

.131 

9 V 

.729 

.078 

c 

-.444 

-.437 

u 

where  p > 0 Is  known.  Denote  by  the  c.d.f.  corresponding  to 

As  In  the  ease  of  the  normal  distribution.  It  la  not  possible  to 

derive  an  explicit  formula  for  Q (u)  • F 'Nu)  for  all  valuea  of  p. 
0 0 

However,  Q exlsta  and  hence  its  valuea  and  the  valuea  of  the  f Q 
o 0^0 


lunccion  may  oe  calculated  through  numerical  procedure. 

When  V la  known,  apacings  taken  according  to  the  denalty 
2(1  ■ P) 


h*(u)  - 


q„(u) 


exp  (u)) 


/ Q^(t)  ^ »xp  {V3Q(t))(lt 


0.2.55) 


will  be  ..ymptotlcally  optimal  for  th.  c.tlmation  of  o.  Spacing, 
obtained  nsing  b*  have  been  computed  by  Saindal  (19b<i)  for  k ■ 1(1)10 
and  p ■ 2, 3, 4,5.  A graph  of  for  theae  aame  valuea  of  p is 
preaented  in  Figure  10. 

3.2.9  Lognormal  Dlatributlon 

Aa  in  prevloua  work,  denote  the  standard  normal  p.d.f . end  c.d.f. 
by  4 and  4 respectively.  For  the  lognormal  distribution,  the  neces- 
aary  functions  for  the  construction  of  optimal  densities  art: 


and 


fj(x)  - pdog  x)  . 0 < X < » , 0.2.56) 

Q^(u)  - exp  (♦**(u))  , 0.2.57) 

- ♦♦"’(u)  exp  {-  ♦"'(u))  . 


0...58) 


Table  30.  I.o('iioranl  nir.irlbuclon,  |(  Knowii;  AKy«{>foriCAny 
Opllnuil  Sp.*!!' i up, fi » Coofricicnts,  Corrci-.!  loit 
Factors  and  Kfficlcnclcs  for  Seven  or  Nine  Order 
Statistics 


Figure  11.  The  Functions  for  Various  Distributions 
In  tl»c  Case  chat  o ts  Known 


k - 7 k ■ 9 


1 

“i 

1 

.023 

.363 

.013 

.263 

2 

.121 

.446 

.074 

.368 

3 

.29 

.349 

.184 

.33 

4 

.5 

.223 

.334 

.253 

5 

.71 

.115 

.5 

.173 

6 

.879 

.043 

.666 

.107 

7 

.977 

.007 

.816 

.055 

a 

.926 

.02 

9 ' 

.987 

.003 

c 

0 

Efficiency 

.9637 

1.546 

.976 

1.571 

Extreae 


3.3  D«c«  Sutomorlcs  for  Large  Sanplo* 

An  Integral  pare  of  exploratory  data  analysis  Is  the  sutBuarlzatlon 
of  a data  batch  by  a few  select  order  statistics.  These  sutmarics 
are  used  to  estloate  location  and  scale  parasKMcrs  and/or  to  find 
re-expressions  (transformations)  of  the  data  to  other  distributional 
fonas  (often  normal).  Several  rules  of  thumb  regarding  the  selection 
of  order  statistics  for  sunssary  purposes  have  been  pioposed,  such  as 
using  the  median,  quartiles  and  extremes,  or  taking  the  median, 
quartlles,  •!  percentiles  percentiles. 

In  Sectlm  3.2  it  was  seen  that  the  optisial  placement  of  order 
statistics  for  location  and  scale  parameter  estimation  depends 
heavily  upon  the  assumptions  made  regarding  the  distributional  type 
of  the  data.  Thus  for  large  data  sets,  where  only  a small  portion 
of  the  saaqile  is  to  be  utilized  for  estimation  purposes,  rhe  use  of 
order  statistics  from  nonpar.^aetric  five  or  seven  number  data 
summaries  may  result  in  estimators  with  low  efficiencies.  A useful 
tool  would  be  a summary  technique  that,  perhaps  after  goodness  of  fit 
tests,  could  be  adapted  to  the  distributional  form  of  the  data. 

The  objective  of  this  section  is  to  suggest  such  an  adaptive  technique 
for  sumaarlzlng  large  data  sets  through  the  use  of  a few  order 
statistics. 

The  specification  of  an  adaptive  data  summary  rule,  in  the 
present  context,  may  be  considered  as  consisting  of  two  parts: 

(a)  a rule  of  thumb  regarding  the  selection  of  s set  of  order 
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statistics  for  a data  suiraury  and  (b)  the  specification  of  optimal 
subsets  of  these  order  statistics  for  use  in  the  estimation  of  p or  o 
for  various  location  and  scale  parameter  models.  In  this  section, 
adaptive  rules  based  on  19  or  21  order  statistics  and  considering 
subsets  of  size  seven  and  nine  respectively  will  be  constructed  using 
results  from  Section  3.2. 

Upon  review  of  the  asymptotically  optimal  spacings  for  seven  or 
nine  order  statistics  in  Section  3.2,  it  is  seen  that  these  spacinga 
tend  to  cluster  about  certain  points  in  the  interval  (0,1).  Since 

for  k • 7,  the  spacings  are  the  values  11*  ^ ^ ^ 

is  no  real  surprise  that  the  points  where  the  spacings  closer  are 
generally  multiples  of  ^ . Similarly  for  k ■ 9,  spacings  tend  to 
accumulate  about  multiples  of  . The  clustering  behavior  of  the 
spacings  may  be  used  to  Justify  the  order  statistics  placement  in 
the  19  and  21  number  data  summary  rules  that  follow. 

A 19  number  adaptive  data  summary  consists  of  the  following 
sample  quantiles: 

1.  The  median,  Q(.3). 

2.  Tlic  — percentile,  Q(.4375)  and  Q(.562S}. 

3.  The  ^ pcrr-entlles,  Q(.37S)  and  Q(.62S). 

4.  The  percentiles,  Q(.3125)  .*xnd  Q(.687S). 

lo 

5.  Tlic  quartilcs,  Q(.2S/  and  Q(.75). 

6.  The  percentiles,  Q(.197S)  and  Q(.B12S). 

7.  The  ^ percentiles,  Q(.12S)  and  Q(.873). 
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8.  Ttic  percentiles,  Q(.062S)  and  Q(.9373}. 

9.  The  — Q pwcentiles,  Q(.02)  and  Q(.98). 

1 - • 

10.  The  Yqq  percentiles,  Q(.Ol)  and  Q(.99). 

For  the  distributions  considered  in  Section  3.2,  Tables  31-32  Indlcste 
which  order  scstlstlcs  should  be  utilized  In  the  estimation  of  p or 
0 respectively  by  an  estimator  based  on  seven  sample  quantiles.  The 
selection  of  an  order  statistic  is  indicated  by  a check  mark  across 
from  Its  corresponding  spacing.  For  example,  for  the  exponential 
distribution  with  p known  Table  32  indicat'es  that  the  spacings  set 
(.3125,  .5625,  .75,  .875,  .9375,  .98,  .99)  should  be  utilized  for 
the  estimation  of  o. 

For  estimators  composed  of  nine  order  statistics,  s 21  nui^cr 
adaptive  data  sunnary  may  be  defined  as  consisting  of  the  following 
sample  quantiles: 

1.  Q(.5) 

2.  Q(.4).  Q(.6) 

3.  Q(.33).  Q(.66) 

4.  Q(.27),  Q(.73) 

5.  5<.20),  q(.eo) 

6.  Q(.17),  Q(.83) 

7.  Q(.IO),  Q(.90) 

8.  Q(.07).  Q(.93) 

9.  Q(.03),  Q(.97) 

10.  Q(.02).  Q(.98) 

11.  Q(.01>,  Q(.99) 


»62S 
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For  the  distributions  considered  In  Section  3.2,  Tables  33-34  indicate 
how  order  statistics  should  be  selected  for  the  estimation  of  y or  o. 
Table  35  contains  the  coefficients  (denoted  by  U),  correction 


Table  33.  Order  Statistic  Selection  for  Location 
Parameter  Estimation  by  Nine  Order 
Scatlsclca 


factors  (denoted  by  C)  and  efficiencies  for  the  spaclnss  sets  suggested 

Distribution 

In  Tables  31-34.  To  use  this  table,  arrange  the  spaclngs  under 

Spacing 

Normal 

Cauchy 

Logistic 

Extreste  Value 

consideration  in  Increasing  order,  * ^2  ^ ^ 

.01 

/ 

/ 

the  coefficient  corresponding  to  Is  W(i).  For  example,  in  the 

.02 

✓ 

case  of  the  exponential  distribution  with  y known,  an  explicit 

.03 

/ 

expression  for  the  estimator  of  o formed  from  7 order  statistics  is 

.07 

✓ • 

/ 

✓ 

.1 

. ✓ 

/ 

.17 

/ 

/ 

0 - .2906q(.3125)  + .2252q(.5625)  + .1587Q(.75)  + .0*98Q(.875) 

.2 

/ 

/ 

* .055Q(.9375)  + .0204q(.98)  + .0144Q(.99)  - .8S4y  . 

.27 

.33 

/ 

✓ 

/ 

(j.j.i) 

.4 

/ 

✓ 

.5 

/ 

✓ 

/ 

/ 

The  efficiency  of  this  estimator  is  .9653. 

.6 

/■ 

/ 

.67 

/ 

/ 

/ 

.73 

✓ 

.8 

/ 

.83 

✓ 

/ 

.9 

✓ 

.93 

/ 

✓ 

.97 

.98 

.99 

/ 

• 

( 
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tonentlal  Distribution 


Pareto  Distribution 
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Extreme  Value  Distribution, 


4.  SPACINCS  FOR  CENSOKFD  SAMPl.F^  ANt) 
QUANTILK  ESTIMATION 


The  RKllS  generated  by  K^,  H(K)»  conslete  of  differentiable 
functions.  For  f and  g In  H(K)  the  Inner  product  Is 


In  this  chapter,  techniques  slaillar  to  thmic  of  Section  2.5 
vlll  be  developed  for  spacing  selection  In  censored  ssaples.  The 
selection  of  order  statistics  for  the  optimal  cstlmotion  of  population 
quantiles  will  also  be  considered. 

4.1  Optimal  Spaclngs  for  Censored  Samples 


Estimating  location  and  scale  parameters  given  a censored  set 


of  order  statistics  <*•*  ^(^q)  easily  formulated  os 

using  the  sso^le  quantile  function,  Q,  over  the  Interval  (p,q)  c (0,1). 
It  can  be^ shown  (Parzen  (1979))  that  a model  for  location  and  scale 
parameter  estimation  In  this  ease  Is 


f,,Q^(u)Q(u)  • ♦ o«B(u)  , 


u c (p.q)  • 


(4.1.1) 


where  {B(u),  u c (p»q)}  Is  a Brownian  Ifrldge  process  on  (p,ql  with 
covariance  kernel 


Kj(u^,Oj)  - BlnCu^.Uj)  - , Uj.Uj  c (p.ql  , (4.1.2) 


end 


(4.1.3) 


A'(u)**(u)4u  ♦ . llaifiM  . 


(4.1.4) 


If  f c H(K)  U twlc.  dlffcrcntUbl.,  th<  r.ptoducln,  property 
end  Integration  by  parts  can  be  used  to  show  that  f has  the  repre* 
sentstlon 

f(o)  • - /'r(t)ICj(u,t)dt  + K,(u.p)li  f(p)  • f'(p)) 

+ K,(u.q)I  f(q)  + f'(q)]. 

(4.1.5) 


By  making  the  Identifications 


- i ((p)  - f(p)  . 

*2 -rh 


(4.1.6) 


Remark  2.3.5  of  Section  2.3  Is  seen  to  be  applicable.  The  next 
theorem  i^lch  Is  the  censored  sample  analogue  of  Theorem  2,5.1  follows 
Ismedlstely  from  this  fact. 

Theorem  4.1.1.  Suppose  the  soi^le  quantile  function,  ^(u).  Is 
available  over  the  Interval  (p,ql  ^ (0,1).  Then  the  following 


results  hold: 


1.  If  has  the  representation  (4.1. S)  on  [p»q]»  define 
the  density 

[f  Q„<u)"l’^’ 


h?  -(“)  • - 

P*^  A 


(4.1.7) 


P 


with  corresponding  c.d.f.  H*  The  spscings 

i ■ 1 k . are  asyuptotically  optinal  for  estiaating 

U when  0 is  known. 

2.  If  ****  representation  (4.1.5)  on  [p,qj,  define 


the  density 


K 

p»q 


/‘‘(IQ  (t)f  Q 


(4.1.8) 


i ■ 1,  ...•  k , are  asymptotically  optimal  for  estimating 


with  corresponding  c.d.f.  H*  The  spacings 


o when  u is  known. 

3.  If  both  f Q and  Q *1  Q admit  tlie  representation  (4.1.5) 
o o o o o 

on  (p»q)«  define  the  density  function 


h*  -(u)  ■ 

Pf<  A 


(A. 1.9) 


♦•(u)  ■ 


(A. 1.10) 


and 


P.9 


’<t  O .£  Q > 

' o'o'  o'o'p.q 

<f  Q .q  Q > 
ow  w*  ow^p»q 


0 0 0 0 o p^q 
o o o o o 0 p»q 


(4.1.11) 


Denote  by  H*  the  c.d.f.  corresponding  to  h*  The 
spacings  ^ ^ asymptotically 

optimal  for  simultaneous  estimation  of  m and  o. 

Theorem  4.1.1  provides  a solution  to  the  optimal  spacings 
problem  for  censored  samples.  The  corresponding  formulas  for  the 
estimators  of  y and  o based  on  asymptotically  optimal  spscings  can 
be  constructed  by  replacing  h*  by  h*  ^ and  H*  * ^ by 

«9"‘l0"‘  (2.5.20),  (2.S.22),  (2.S.2A)  and  (2.5.25) 

of  Section  2.5. 


4.2  Optimal  Spacings  for  Quantile  Estimation 
For  the  location  and  scale  parameter  model 
"(*>  - 


the  population  quantile  function.  Q,  ha.  th.  fora 


Q(u)  - u + oq  (u) 


(A.2.2) 


where  Q (u)  • F '(u)  . Hilo  section  will  address  the  problcn  of 
o o 

how  to  optimally  select  order  statistics  for  the  estimation  of  the 
population  quantile,  Q(p). 

First  observe  that  since  Q^(p)  ie  known,  Q(p)  is  a linear 
combination  of  Che  unknown  parameters  p and  o.  Thus  quantile 
csclnatlon  may  be  considered  as  a special  case  of  the  estimation  of 


For  a given  vector,  t*  • <£^,12)*  known  from  the  theory 

of  least  squares  (Craybill  (1976))  that 


I.C.,  Che  ABLUE  of  a linear  combination  of  the  parameters  is  the 
same  linear  combination  of  the  ABLUE's  of  M and  o.  Also  note  that 


V(tjl  + ^ t*A’*l 


. 2-  tr(A"‘u') 


where  A is  the  information  matrix  of  Section  5.2  and  tr  denotes  the 


choose  order  statistics  in  such  a Buxnncr  that  tr(A  tt*)  is  a minimum. 
Sacks  and  Ylviaaker  (1968)  have  derived  an  asymptotic  solution  to 
this  problem  that  m.'iy  be  tiscd  to  prove  the  next  theorem. 


Theorem  4.2. 1»  Let  I*  • (t2*^2^  * known  vector  of  constants  and 

define  4i(u)  as  in  (4.1.9).  Spaclncs  generated  by  the  density 


U'(u)U'*(u)]'3 

/*[9'(t)U'*(t)J*^5dt 

0 


will  be  asymptotically  optimal  for  the  estimation  of  * £2°* 

The  following  corollary  to  Theorem  4.2.1  details  an  asymptotic 
solution  to  the  problem  of  optimal  spacln'g  selection  for  quantile 


Corollary  4.2.1,  Let  Q have  the  form  (4.2.2)  and  let  p c (0,1)  be 
•pcclfied.  Define  the  density  function 


U’(u)M*(u)j' 


Lq„(p)  Q'(p)J 


and  9(u)  defined  as  in  (4.1.9).  The  sequence  of  spacings  sets 
generated  by  h*  la  asymptotically  optinuil  for  the  eatlmatlon  of  Q(p). 


ns 

5.  CONCLUSION 

5.1  Sunary 

A general  approach  to  obtaining  optimal  spacings  for  linear 
ayatcmatle  cetlsatora  of  location  and/or  scale  parameters  has  been 
formulated  in  this  dissertation.  By  treating  the  problem  of  location 
and  scale  parameter  estimation  by  linear  functions  of  order  statistics 
as  one  of  regression  analysis  of  a sample  quantile  process, 
it  was  found  Chat  the  optimal  spacings  problem  was  equivalent  to  a 
regression  design  problem.  This  approach  was  seen  to  have  advantages 
over  classical  techniques  in  that  it  provided  a unified  regression 
framework  for  optiMl  location  and  scale  parameter  estimation  and 
led  to  compucatlonally  simple  solutions  to  the  optimal  spacings 
problem. 

The  basic  theory  was  developed  in  Chapter  2 where  asymptotic 
results  regarding  designs  for  continuous  parameter  time  series  were 
employed  to  obtain  spacings  sets  that  were  asymptotic.illy  optimal. 

This  asymptotic  optimality  can  be  interpf^etcd  ss  meaning  that  the 
spacings  sets  result  in  nearly  optimal  efficiencies  as  the  number  of 
spacings  Included  in  the  sets  becomes  large. 

Tlu*  theory  dovolopcil  Jn  Clmpicr  2 wai;  applied  to  Hcvcral  common 
distributional  forms  in  Chapter  3.  The  asymptotically  optimal 
spacings  sets  were  seen  to  give  nearly  optimal  efficiency  for  set  size 
as  small  as  seven  or  nine.  Further,  the  propensity  of  these  spacings 


wlLu 
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to  cluster  about  certain  values  made  it  possible  to  propose  some 
adaptive  procedures  for  suauirlsing  large  data  sets  with  a fev  order 
statistics. 

In  Chapter  4,  an  analogue  of  the  asymptotic  theory  for  optimal 
spacings  selection  in  uncensored  samples  was  developed  for  the  case 
of  censored  sai^>lea.  Asymptotically  optimal  spacings  for  population 
quantile  estimation  were  also  obtained. 

5.2  Problems  for  Further  Research 

Several  problems  arise  in  the  application  of  the  theory  developed 
in  Chapter  2 due  to  the  Integral  representation  assumed  for  the 
f^Q^  and  Q^'fgQp  functions.  For  this  reason  an  approach  to  optimal 
spacings  selection  for  functions  that  can  only  claim  membership  in 
the  RKllS  generated  by  would  be  worthwhile. 

There  are  several  cases  where  either  f Q “ or  (Q  * f Q )"  behave 

0 0 0 o o 

so  poorly  at  zero  and/or  one  that  they  fall  to  be  integrable  on  the 
closed  interval  [0,1].  An  obvious  procedure  in  this  case  would  be 
to  use  an  appropriate  subincerval  [p.q]  of  [0,1]  and  employ  the 
results  of  Section  4.1  to  obtain  spacings.  However,  this  approach 
seems  to  be  quite  sensitive  to  the  choices  for  p and  q.  Thus  tech'> 
nlqucN  for  selecting  p nnd  q In  an  optimal  manner  would  be  quite 
useful. 

An  extension  of  the  results  of  Chapter  3 to  other  distributions 


end  estimation  situations  is  needed.  Of  particular  interest  la 
whether  the  placement  of  order  etetiatica  suggested  in  Section  3.3 
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will  suffice  for  a still  wider  ranee  of  dictributlon  typos  than  those 
for  which  it  was  constructed. 
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